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CROSS-REFERENCE TO RELATED APPLICATIONS 
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xxxxx 

FIELD OF THE INVENTION 

The present invention is directed to a novel human DNA 
15 sequence encoding a motilin receptor, the receptor encoded by the 
DNA, and the uses thereof. 

BACKGROUND OF THE INVENTION 

Gastrointestinal (GI) motility is a coordinated neuromuscular 
20 process which transports nutrients through the digestive system. 

Impaired GI motility, can lead to irritable bowel syndrome, constipation 
and diabetic and post-surgical gastroporesis and is one of the largest 
health care burdens of industrialized nations. Motilin, a 22 amino acid 
prokinetic peptide is expressed throughout the gastrointestinal tract in a 
25 number of species including humans. Released from endochromafffin 
cells of the small intestine, motilin exerts a profound effect on gastric 
motility with the induction of interdigestive (phase HI) antrum and 
duodenal contractions. The unrelated macrolide antibiotic 
erythromycin also possesses prokinetic properties mediated by its 
30 interaction with motilin receptors. These account for erythromycin's 
GI side-effects, including vomiting, nausea, diarrhea and abdominal 
muscular discomfort. 

Motilin receptors have been detected in the GI tract and recently 
in the central nervous system, but their molecular structure has not been 
35 reported. Although motilin receptor characterization has been actively 
pursued in humans and other species since the isolation of motilin from 
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porcine intestine in 1972, the receptor itself has not been isolated nor 
cloned. 

Motilin is highly conserved across species and is synthesized as 
part of larger pre-prohormone. Mature 22 amino acid motilin is 

5 generated by removal of its secretory signal peptide and cleavage at the 
first C-teiminally located dibasic prohormone convertase recognition 
site. Using radioligand binding, autoradiography and in vitro biossays, 
high affinity and low density, motilin receptors were detected in smooth 
muscle cells of the gastrointestinal tract of humans, cats and rabbits. 

10 Cerebellar brain receptors for motilin were also described supporting 
the notion that motilin may act in the central nervous system. Native 
motilin receptors appear to be coupled to G proteins and activate the 
phosphlipase C signal tranduction pathway resulting in Ca2+ influx 
through L-type channels. 

15 The development of safe and selective motilin receptor agonists is 

likely to aid the treatment of disorders resulting from impaired GI 
motility. Thus, it would be desirable to be able to isolate, and clone the 
motilin receptor, and to use this in assays for agonists and antagonists. 

20 SUMMARY OF THE INVENTION 

The present invention is directed to a novel G -protein 
coupled receptor (GPCR), designated as motilin receptor. Two spliced 
forms of the motilin receptor were identified: MTL-R1A, which 
encodes a functional seven- transmembrane domain form, and MTL- 

25 RIB, which encodes a truncated five-transmembrane domain form. 
Both fonns make up embodiments of this invention. 

Another aspect of this invention are nucleic acids which 
encode the motilin receptor, which are isolated, or free from associated 
nucleic acids. 

30 Other aspects of this invention include assays for 

identifying motilin ligands which are agonists and antagonists of a 
motilin receptor comprising contacting a candidate ligand with a motilin 
receptor and determining if binding occurred. 

Another aspect of this invention is a method for 

35 determining whether a ligand is capable of binding to a motilin receptor 
comprising: 
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(a) transfecting test cells with an expression vector 
encoding motilin receptor; 

(b) exposing the test cells to the ligand; 

(c) measuring the amount of binding of the ligand 
5 to the motilin receptor; 

(d) comparing the amount of binding of the ligand 
to the motilin receptor in the test cells with the amount of binding of the 
ligand to control cells that have not been transfected with the motilin 
receptor 

10 where if the amount of binding of the ligand to the test cells 

is greater than the amount of binding of the ligand to the control cells, 
then the substance is capable of binding to motilin receptor. 

BRIEF DESCRIPTION OF THE FIGURES 
15 Figure 1 shows the DNA sequence of motilin receptor gene 

including 5' untranslated region (SEQ.ID.NO.rl). Lntronic sequences 

are shown in lower case type. 

Figure 2 shows the DNA sequence of motilin receptor 

spliced form A (MTL-R1 A) (SEQ.ID.NO.:2). 
20 Figure 3 shows deduced amino acid sequence of MTL-R1 A 

(SEQ.ID.NO.:3). 

Figure 4 shows the DNA sequence of motilin receptor 
spliced form B (MTL-R1B) (SEQ.ID.NO.:4). 

Figure 5 shows the deduced amino acid sequence of MTL- 
25 RIB (SEQJD.NO.:5). 

Figures 6 A-C compare DNA and protein sequence for 
MTL-R1A and MTL-R1B. 

Figure 7 shows the DNA sequence of puffer fish clone 
75E7 (SEQ.nXNO.:6). 
30 Figure 8 shows the deduced amino acid sequence of puffer 

fish clone 75E7 protein sequences (SEQ.ED.NO.:7). 

Figure 9 shows the comparison of human MTL-R1 A and 
puffer fish clone 75E7 protein sequences. 

Figure 10 is a graph illustrating the pharmacological 
35 characterization of the cloned MTL-R1 A in the aequorin 
bioluminescence assay in HEK-293 cells. 

-3- 
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Figure 1 1 is a graph illustrating the pharmacological 
characterization of the cloned MTL-R1A in the [125l]-Tyr7-human 
motilin binding assay. 

5 As used throughout the specification and claims, the 

following definitions apply: 

"Substantially free from other proteins" means at least 
90%, preferably 95%, more preferably 99%, and even more preferably 
99.9%, free of other proteins. Thus, for example, a MTL-R1 protein 

10 preparation that is substantially free from other proteins will contain, as 
a percent of its total protein, no more than 10%, preferably no more 
than 5%, more preferably no more than 1 %, and even more preferably 
no more than 0.1%, of non- MTL-R1 proteins. Whether a given MTL- 
Rl protein preparation is substantially free from other proteins can be 

15 determined by such conventional techniques of assessing protein purity 
as, e.g. 9 sodium dodecyl sulfate polyacrylamide gel electrophoresis 
(SDS-PAGE) combined with appropriate detection methods, e.g., silver 
staining or immunoblotting, 

"Substantially free from other nucleic acids" means at least 

20 90%, preferably 95%, more preferably 99%, and even more preferably 
99.9%, free of other nucleic acids. Thus, for example, a MTL-R1 DNA 
preparation that is substantially free from other nucleic acids will 
contain, as a percent of its total nucleic acid, no more than 10%, 
preferably no more than 5%, more preferably no more than 1%, and 

25 even more preferably no more than 0.1%, of non- MTL-R1 nucleic 

acids. Whether a given MTL-R1 DNA preparation is substantially free 
from other nucleic acids can be determined by such conventional 
techniques of assessing nucleic acid purity as, e.g., agarose gel 
electrophoresis combined with appropriate staining methods, e.g., 

30 ethidium bromide staining, or by sequencing. 

"Functional equivalent" means a receptor which does not 
have the exact same amino acid sequence of a naturally occurring 
motilin receptor, due to alternative splicing, deletions, mutations, or 
additions, but retains at least 1%, preferably 10%, and more preferably 

35 25% of the biological activity of the naturally occurring receptor. Such 
derivatives will have a significant homology with a motilin receptor and 

-4- 
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can be detected by reduced stringency hybridization with a DNA 
sequence obtained from a motilin receptor. The nucleic acid encoding a 
functional equivalent has at least about 50% homology at the nucleotide 
level to a naturally occurring receptor nucleic acid. 
5 "Ligand" means any molecule which binds to a motilin 

receptor of this invention. These ligands can have either agonist, partial 
agonist, partial antagonist or antagonist activity. 

DETAILED DESCRIPTION OF THE INVENTION 

10 The cloning of GPCR's related to the hypothalamic and 

pituitary receptor for the growth hormone (GH) secretagogues (GHSs) 
which mediate sustained pulsatile GH release has been recently 
described. (McKee el, aL, 1997 Genomics 46:426-434, which is hereby 
incorporated by reference). One of these clones, GPR38, possessed the 

15 most significant amino acid sequence identity to the human GHSR (52%) 
(rising to as high as 86% in transmembrane domains (TM). GPR38 was 
classified as an orphan GPCR (GPCRs for which a natural ligand has 
not been identified). 

GPR38 was isolated from a human genomic DNA library 

20 and contained a single intron of approximately 1 kb, as shown in 
FIGURE L cDNA clones were isolated to obtain the nucleotide 
sequence of correctly spliced GPR38 mRNA. Efforts to isolate cDNA 
clones by standard library screening proved unsuccessful. 

A combination of RACE and RT-PCR techniques resulted 

25 in the identification of two spliced forms for GPR38. These two 

GPR38 cDNAs use distinct splice donor sites and a common acceptor 
site (perfect match to consensus exon-intron splice acceptor junction 
sequence [pyrimidine-rich stretch ag/TG]). GPR38-A mRNA (imperfect 
match to consensus donor sequence [TGC/gt]) encodes a polypeptide of 

30 412 amino acids with seven alphahelical TM domains, the hallmark 
feature of GPC-Rs, whereas GPR38-B encodes a 363 amino acid 
polypeptide with five TM domains (perfect donor sequence [CCG/gt]). 
Northern blot analysis failed to reveal an expression profile for GPR38. 
However, when RNase protection was employed expression was 

35 demonstrated in stomach, thyroid and bone marrow. 
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It accordance with this invention, it has been found that 
GPR38 is the motiiin receptor. Thus, this invention is directed to the 
human motiiin receptor, its functional equivalents, motiiin receptors 
from other species which can be isolated using fragments of the human 
5 motiiin DNA as probes, and to splice varients of the motiiin receptor. 

The intact motiiin receptor of this invention was found to 
have structural features which are typical of G-protein linked receptors, 
including seven transmembrane (TM) domains, three intra- and 
extracellular loops, and the GPCR protein signature sequence. The TM 
10 domains and GPCR protein signature sequence are noted in the protein 
sequences of the GPCR in Figures 6A-C. 

A high-throughput assay was developed which measures 
Ca2+ realease with the bioluminescent Ca2+ sensitive-aequorin reporter 
protein (capable of measuring ligand-induced EP3-coupled mobilization 
15 of intracellular calcium and concomitant calcium-induced aequorin 
bioluminescence). Expression of cloned GPR38-A in eel] membranes 
was confirmed using epitope-tagged protein which revealed a single 
protein species with a molecular weight of approximately 45,000 
daltons containing an open reading frame encoding 412 amino acids 
20 (SEQ. ID.NO.:3). The DNA and deduced amino acid sequence are 
given in SEQ.ID, NCX:2 and SEQ.ID. NO.:3, respectively. 

A broad set of peptide and non-peptide molecules were 
tested at a single concentration in transiently transfected HEK-293/aeql7 
cells (100 nM peptide, 10 ^iM non-peptide). Significant bioluminescent 
25 responses were recorded for the peptide motiiin and the non-peptide 
macrolide erythromycin, which was reported to be a competitive 
agonist at motiiin receptors. Full dose-response curves confirmed this 
observation. 

Nucleotide sequence analysis revealed two splice fomis of 
30 human motiiin receptor both of which make up further aspects of this 
invention. The first (MTL-R1 A) encodes a seven transmembrane 
domain receptor. The full length open reading frame appears to contain 
412 amino acids. The second splice form (MTL-R1B) diverges in its 
nucleotide sequence from MTL-R1A just before the predicted amino 
35 acid of the sixth transmembrane domain (TM6). 
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In the MTL-R1B, TM5 is truncated and fused to a 
contiguous reading frame of about 86 amino acids, followed by a 
translation stop codon. The DNA and amino acids sequences encoding 
MTL-R1A and MTL-R1B are given in FIGURES 2-5. 
5 A further aspect of this invention is a related motilin 

receptor gene, evident in the teleost puffer fish Spheroides nephelus. 
Screening of a puffer fish genomic library identified a single clone 
(75E7) containing an open reading frame of 363 amino acids 
(approximately 54% identical at the protein level) which contains a 

10 similar exon-intron structure to GPR38. Analysis of clone 75E7 shows 
an amino acid sequence to contain 363 amino acids with a molecular 
weight of approximately 41,323 daltons. (FIGURE 8). DNA sequence 
of puffer fish clone 75E7 is given in SEQ.ID.NO.:6, and a comparison 
of human MTL-R 1 A and puffer fish clone 75E7 protein sequences is 

15 given in FIGURE 9. 

Another aspect of this invention relates to vectors which 
comprise nucleic acids encoding a motilin receptor or a functional 
equivalent. These vectors may be comprised of DNA or RNA; for most 
c!6ning purposes DNA vectors are preferred. Typical vectors include 

20 plasmids, modified viruses, bacteriophage and cosmids, yeast artificial 
chromosomes and other forms of episomal or integrated DNA that 
encode a motilin receptor. It is well within the skill of the ordinary 
artisan to determine an appropriate vector for a particular gene transfer 
or other use. 

25 A further aspect of this invention are host cells which are 

transformed with a gene which encodes a motilin receptor or a 
functional equivalent. The host cell may or may not naturally express a 
motilin receptor on the cell membrane, Preferrably, once transformed, 
the host cells are able to express the motilin receptor or a functional 

30 equivalent on the cell membrane. Depending on the host cell, it may be 
desirable to adapt the DNA so that particular codons are used in order 
to optimize expression. Such adaptations are known in the art, and these 
nucleic acids are also included within the scope of this invention. 
Generally mammalian cell lines, such as HEK-293, COS , CHO, HeLa, 

35 NS/), CV-1, GC, GH3 or VERO cells are preferred host cells, but other 
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cells and cell lines such as Xenopus oocytes or insect cells, may also be 
used. 

Human embryonic kidney (HEK 293) cells and Chinese 
hamster ovary (CHO) cells are particularly suitable for expression of 
5 motilin receptor proteins because these cells express a large number of 
G-proteins. Thus, it is likely that at least one of these G-proteins will be 
able to functionally couple the signal generated by interaction of motilin 
receptors and their ligands, thus transmitting this signal to downstream 
effectors, eventually resulting in a measurable change in some assayable 

10 component, e.g., cAMP level, expression of a reporter gene, hydrolysis 
of inositol lipids, or intracellular Ca2+ levels. 

A variety of mammalian expression vectors can be used to 
express recombinant motilin in mammalian cells. Commercially 
available mammalian expression vectors which are suitable include, but 

15 are not limited to, pCR2.2 (Lnvitrogen), pMClneo (Stratagene), pSG5 
(Stratagene), pcDNAI and pcDNAIamp, pcDNA3, pcDNA3.1 , pCR3.1 
(lnvitrogen), EBO-pSV2-neo (ATCC 37593), pBPV- 1(8-2) (ATCC 
371 10), pdBPV-MMTneo(342-12) (ATCC 37224), pRSVgpt (ATCC 
37199). pRSVneo (ATCC 37198), and pSV2-dhfr (ATCC 37146). 

20 Following expression in recombinant cells, motilin receptors can be 
purified by conventional techniques to a level that is substantially free 
from other proteins. 

The specificity of binding of compounds showing 
affinity for motilin receptors is shown by measuring the affinity of 

25 the compounds for recombinant cells expressing the cloned receptor 
or for membranes from these cells. Expression of the cloned 
receptor and screening for compounds that bind to motilin receptors 
or that inhibit the binding of a known, radiolabeled ligand of motilin 
receptors to these cells, or membranes prepared from these cells, 

30 provides an effective method for the rapid selection of compounds 
with high affinity for a motilin receptor. Such ligands need not 
necessarily be radiolabeled but can also be nonisotopic compounds 
that can be used to displace bound radiolabeled compounds or that 
can be used as activators in functional assays. Compounds identified 

35 by the above method are likely to be agonists or antagonists of 
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motilin receptors and may be peptides, proteins, or non- 
proteinaceous organic molecules. 

Such molecules are useful in treating a variety of gastric 
conditions, including gastric motility disorders (intrinsic myopathies 
5 or neuropathy), functional defects, disorders which are secondary to 
neurologic disorders including spinal cord transections, amyloidosis, 
collagen vascular disease (e.g. scleroderma), paraneoplastic 
syndromes, radiation-induced dysmotility, diabetes, infections, 
stress-related motiliy disorders, psychgenic/functional disorders, 

10 other drugs which affect motility (e.g. beta andrenergic drugs which 
may delay gastric emptying, cholinergic agents or opiates, or 
serotonin receptor antagonists), gastroparesis (diabetic or post- 
surgical), gastro-esophageal reflux disease, constipation, chronic 
idiopathis pseudo-obstruction and acute fecal impaction, 

15 postoperative ileus, gallstones, infantile collie, preparation for 

colonoscopy and endoscopy, duodenal intubation , irritable bowel 
syndrome, non-ulcer dyspepsion, non-cardiac chest pain and 
diarrhea. 

The pharmacological characterization of the cloned MTL- 
20 Rl A in the aequorin bioluminescence assay in HEK-293 cells is shown 
in Figure 10 and in the [125l]-Tyr7 -human motilin binding assay 
(Figure 11). Motilin at concentrations as high as 10 ^iM gave no 
bioluminescent response above background levels in cells that were not 
transfected with the MTL-R1 A cDNA expression vector. Similarly, 
25 non -transfected cells did not show appreciable binding of [125l]-Tyr7- 
human motilin. 

The rank order of potency for motilin, motilin peptide 
fragments and non-peptide molecules is consistent with experiments 
performed on native motilin receptors, from stomach or intestinal 
30 tissues. 

Due to the high degree of homology to GPCRs, the motilin 
receptor of this invention is believed to function similarly to GPCRs and 
have similar biological activity. They are useful in understanding the 
biological and physiological pathways involved in gastrointestinal 
35 motility. They may be also used to scan for motilin agonists and 

antagonists; as in particular to test the specificity of identified ligands. 
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The following, non-limiting Examples are presented to 
better illustrate the invention. 

5 EXAMPLE! 

Sequence Comparison of MTL-R1 (GPR38) to human GHS-R, Puffer 
Fish 75E7 and Identification of Alternatively Spliced Forms. 

10 Inspection of the MTL-1 genomic DNA sequence revealed two 

potential mRNA splice sites corresponding to consensus boundaries for 
exon/intron junctions. An imperfect donor site (TGC/gt) was found at 
nucleotides 1929-31 (Fig. 1). a perfect donor site (CCG/gt) was found at 
nucleotides 2080-82, and a single perfect splice acceptor site (sequence 

15 [pyrimidine-rich stretch ag/TG]) was observed at nucleotides 2729-32. 
To determine which splice forms exist naturally, RACE f (rapid 
amplification of cDNA ends) was performed on thyroid poly (A)+ 
mRNA and RT-PCR (reverse transcriptase polymerase chain reaction) 
was conducted on HEK-293/aeql7 cells transfected with the MTL-1 

20 genomic DNA construct. Directional RACE reactions were conducted 
on thyroid poly (A)+ mRNA that had previously been shown by RNase 
protection assay to contain transcripts for MTL-1 R. Primer API 5'- 
CCA TCC TAA TAC GAC TCA CTA TAG GGC-3' (SEQJD.NO.:8) 
corresponds to the 5' end of the coding region including the 

25 presumptive Met initiation codon located within the cloning vector. 
5 RACE1 corresponds to the 3' end of the MTL-1 R coding region 
including the translation termination codon TAA. 5' RACE1: 5'-TTA 
TCC CAT CGT CTT CAC GTT AGC GCT TGT CTC-3' 
(SEQJD,NO.:9). 

30 RACE reactions were carried out on 1 |xg of thyroid poly (A)+ 

mRNA using the Marathon cDNA amplification/advantage PCR kit as 
per the manufacturer's instructions (Clontech) using the following 
Touchdown PCR amplification conditions: 94°C for 1 min., 5 cycles of 
94°C for 30 sec. and 72°C for 4 mim; 5 cycles of 94°C for 30 sec. and 

35 70°C for 4 mim; 25 cycles of 94 °C for 20 sec and 68°C for 4 min. An 
approximately 1 .4 kb amplified product was identified which hybridized 

-10- 
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with a 32p-iabeJed probe derived from the TM 2-4 region (3F/4R 
probe) of the MTL-R. This product was subcloned into PCR-Script 
vector (Irjvitrogen) and sequenced. 

As diagrammed in Figures 6A-C, DNA sequence analysis 
5 revealed two distinct sequences corresponding to alternative use of two 
splice donor sequences and a common splice acceptor sequence. These 
results were confirmed by transfecting the MTL-1 genomic construct 
containing the complete ORF interrupted by a single intron of 
approximately 0.7 kb into HEK-293/aeql7 cells. mRNA was the 

10 isolated (Poly (A) + Pure Kit, Ambion) and shown by Northern blot 
analysis using the 3F/4R probe to give two hybridizing bands: 2.4 kb 
containing the unspliced intron and approximately L4 kb encoding 
spliced forms. RT-PCR was then performed (Superscript 2 One-Step 
Kit, Life Technologies) on MTL-1 mRNA from transfected HEK- 

15 293/aeql7 cells using the forward primer 5' RACE1 and reverse primer 
3' RACE2 (TM5 region): 5-CTG CCC TTT CTG TGC CTC AGC 
ATC CTC TAC-3' (SEQ.ID.NO.:10) 

An approximately 500 bp product was cloned (TA vector 
pCR2.2, Invitrogen), sequenced and shown to be a mixture of both 

20 splice forms. Assembly of the complete ORF for MTL-1 A without 
intronic sequence was performed by ligation of an exon 1 fragment 
(Not I digestion of a MTL-1 plasmid containing the intron in pCDNA- 
3) to pCDNA-3.1 containing a Not 1/EcoRl exon 2 fragment. 

To document protein expression, an MTL-1 A plasmid encoding a 

25 amino-terminal FLAG epitope was constructed by ligation of a Pme 1 
fragment from the MTL-1 A/pcDNA-Ll vector into the EcoRV site of 
pFLAG/CMV-2 vector (Kodak Imaging Systems). Following 
transfection of this plasmid into HEK-293/aeq 1 7 cells, a protein of the 
expected size (approximately 48 kDa) was detected in crude cell 

30 membranes by immunoblot analysis. 

EXAMPLE 2 
Identification of Ligand Specific to Motilin Receptor 



35 



To identify a ligand for this orphan GPCR and to determine 
whether the full length, 7 TM domain GPR38-A is a functional GPCR, a 
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high-throughput assay was developed which measures Ca2+ release with 
the bioluminescent Ca2+ sensitive aequorin reporter protein (capable of 
measuring ligand-induced IP3-coupled mobilization of intracellular 

calcium and concomitant calcium-induced aequorin bioluminescence). 
5 Expression of GPR38-A in cell membranes was confirmed using 
epitope-tagged protein which revealed a single protein species with a 
molecular weight of approximately 45,000 daltons. 

A broad set of peptide and non-peptide molecules was tested at a 
single concentration in transiently transfected HEK-293/aeql7 cells (100 

10 nM peptide, 10 JJ.M non-peptide). Significant bioluminescent responses 
(> 4-fold over background) were recorded for the peptide motilin and 
the non-peptide macrolide erythromycin, which was reported to be a 
competitive agonist at motilin receptors. Full dose-response curves 
confirmed this observation. The half-maximal effective concentration 

15 (EC50) for human/porcine motilin was 2.1 +/- 0.5 nM motilin whereas 

erythromycin was considerably less potent (2000 +/- 210 nM; as 
expected from studies performed on native motilin receptors). 

The signal tranduction pathway for the cloned GPR38-A motilin 
receptor (MTL-R1A) is through activation of phospholipase C, which 

20 has been reported for native motilin receptors. Direct radioligand 

binding studies with [125l] human motilin on cell membranes prepared 
from transfected cells show that MTL-R1 A confers high affinity and 
specific binding (Kd= 0.1 nM; B ma x= 240 fmol/mg protein) which are 
strongly G protein coupled (> 80% inhibition of binding with 100 nM 

25 GTPyS) . 



EXAMPLE 3 
Functional Activation of the MTL-1 A Receptor 

30 

The aequorin bioluminescence assay is a reliable test for 
identifying G protein-coupled receptors which couple through the Got 
protein subunit family consisting of Gq and Gn which leads to the 

activation of phospholipase C, mobilization of intracellular calcium and 

35 activation of protein kinase C. Measurement of MTL-1 A expression in 

the aequorin-expressing stable reporter cell line 293-AEQ17 (Button, 
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D. et. al.,1993 Cell Calcium 14: p. 663-671.) was performed using a 
Luminoskan RT luminometer (Labsystems Inc., Gaithersburg, MD). 

293-AEQ17 cells (8 x 105 cells plated 18 hrs. before transfection 
in a T75 flask) were transfected with 22 ^ig of human MTL-R1 A 

5 plasmid DNA: 264 |ig lipofectamine. Following approximately 40 
hours of expression the apo-aequorin in the cells was charged for 4 
hours with coelenterazine (10 |iM) under reducing conditions (300 \iM 
reduced glutathione) in ECB buffer (140 mM NaCl, 20 mM KC1, 20 
mM HEPES-NaOH [pH=7.4], 5 mM glucose, 1 mM MgCl2, 1 mM 

10 CaCl2, 0.1 mg/ml bovine serum albumin). The cells were harvested, 

washed once in ECB medium and resuspended to 500,000 cells/ml. 100 
[i] of cell suspension (corresponding to 5x104 cells) was then injected 
into the test plate, and the integrated light emission was recorded over 
30 seconds, in 0.5 second units. 20 |iL of lysis buffer (0.1 % final 

15 Triton X-100 concentration) was then injected and the integrated light 
emission recorded over 10 seconds, in 0.5 second units. The "fractional 
response" values for each well were calculated by taking the ratio of the 
integrated response to the initial challenge to the total integrated 
luminescence including the Triton X-100 lysis response. 

20 

EXAMPLE 4 

Binding of [ 1 25l] Human Motilin to Crude Membranes from HEK-293 
Cells transfected with the MTL-R 1 A cDNA. 



25 The binding of [1251] human motilin to crude membranes 

prepared from HEK-293/aeql7 cell transfectants was performed as 
follows. Crude cell membranes were prepared on ice, 48 hrs. post- 
transfection. Each T-75 flask was washed twice with 10 ml of PBS, 
once with 1 ml homogenization buffer (50 mM Tris-HCl [pH 7.4], 10 

30 mM MgCl2- 10 ml of homogenization buffer was added to each flask, 
cells were removed by scraping and then homogenized using a Polytron 
device (Brinkmann, Syosset, NY; 3 bursts of 10 sec. at setting 4). The 
homogenate was centrifuged for 20 mim at 1 1 ,000 x g at 0°C and the 
resulting crude membrane pellet (chiefly containing cell membranes and 

35 nuclei) was resuspended in homogenization buffer supplemented with 
1 .5 % BSA (0.5 ml T75 flask) and kept on ice. 
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Binding reactions were performed at 20°C for 1 nr. in a total 
volume of 0.5 ml containing: 0.1 ml of membrane suspension 
(approximately 1 ug protein), 10 |il of 125]-human motilin, 10 |il of 
competing drug and 380-390 fj.1 of homogenization buffer. Bound 

5 radioligand was separated by rapid vacuum filtration (Brandel 48-well 
cell harvester) through GF/C filters pretreated for 1 hr. with 0.5% 
polyethylenimine. After application of the membrane suspension to the 
filter, the filters were washed 3 times with 3 ml each of ice-cold 50 mM 
Tris-HCl [pH 7.4], 10 mM MgCl2, and the bound radioactivity on the 

10 filters was quantitated by gamma counting. Specific binding (> 90% of 
total) is defined as the difference between total binding and non-specific 
binding conducted in the presence of 100 nJvl unlabeled human motilin. 
Competition binding data were analyzed by a nonlinear curve-fitting 
program (Prism V, version 2.0; GraphPad Software, San Diego, CA). 

15 Results shown are the means (+/- SEM) of triplicate determinationst 
Human motilin was radiolabeled with 125i a t 7Tyr to a specific activity 
of approximately 2000 Ci/mmol (Woods Assay, Portland, OR). 

Structure-function analysis suggest that the motilin peptide 
minimally contains an N-terminal region (amino acids 1-7) essential for 

20 activity, linked to a C-terminal alpha helical domain which stabilizes the 
N-terminal active site region activity. The rank order of potency of 
several motilin peptide analogs in the MTL1-A functional and binding 
assays correlates with their reported potency measured by in vitro 
contractility assays (Table 1 ) performed on native motilin receptors in 

25 intestinal tissue. These results are summarized in Table 1 below. 



Cloned MTL-1 A Receptor 
(human) 



Ligand 


Aequorin 


[1251] hmotilin 




Assay 


binding 




(EC50nM) 


(IC5OJ1M) 


human motilin 


2.1 


0.5 


(MTL) 






erythromycin 


2000 


427 


roxithromycin 


1950 


613 


metoclopramide 


> 10,000 


> 10,000 


cisapride 


> 10.000 


> 10,000 
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canine motilin 


4.4 


0.2 


Leu 13 MTL 


3.9 


0.2 


1-11 MTL 


138 


127 


1-12 MTL 


72 


14 


1-13 MTL 


3.8 


0.9 


1-19 MTL 


4.1 


0.3 


10-22 MTL 


> 10,000 


1100 



The unrelated prokinetic agents metoclopromide and cisapride 
which have affinity for dopamine and/or 5-HT receptors were inactive, 
even at high (10 |iM) doses. 

5 

EXAMPLE 5 
Southern Blot Analysis 

A genomic Southern blot (EcoRI and BamHl -digested DNA, 10 
10 ng/lane) was hybridized with the ORF of MTL-1 A. Post- 

hybridizational washing stringencies were at 55°C 4 X SSPE after which 
the filters were dried and exposed to X-ray film for 5 days at -70°C. 
Lambda Hind III DNA markers were (in kb), 23.1, 9.4, 6.6, 4.4, 2.3, 
2.1. Southern blot analysis conducted in a variety of mammalian and 
15 non-mammalian species revealed a simple hybridization pattern 
consistent with a single, conserved gene encoding MTL-1 A. 

EXAMPLE 6 
Puffer Fish Clone 75E7 

20 

Screening of a puffer fish genomic library identified a single 
clone (75E7) containing an open reading frame of 363 amino acids with 
approximately 54% protein sequence identity to the human MTL-R1 A 
In addition, 75E7 has a similar intron-exon structure to the human 
25 MTL-R1A. 75E7 may be the ortholog of the human MTL-R1 A. 
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EXAMPLE 7 
Ex pression of the MTL-R1 A Gene 

5 Transcripts of MTL-1 A were detected by RNase Protection 

Assay (RPA). Synthesis of high -specific activity radiolabeled antisense 
probes and the RPA was conducted using a kit (MAXlscript and 
HybSpeed RPA kits; Ambion, Austin, TX) essentially as described by 
the manufacturer. The anti-sense cRNA MTL-1 A probe was 

10 synthesized from a cDNA template encompassing nt 1234 to 1516 of the 
human MTL-1 A inserted behind the T7 promoter in pLitmus 28 (New 
England Biolabs, Beverly, MA). Digestion of the construct with Stu I 
generated a cRNA transcript approximately 340 nt in size with 
approximately 60 nt of vector sequence. Input poly A + mRNA 

15 (Clontech, Palo Alto, CA) was 5 g for the MTL-1 A probe and 0.1 |ig 
for a control human actin probe. Precipitated fragments were subjected 
to slab-gel electrophoresis (42 cm x 32 cm x 0.4 mm) in 5 % 
acrylamide/Tris-borate-EDTA buffer containing 8 M urea. The gels 
were fixed, dried and autoradiographed on film (X-Omat; Kodak, 

20 Rochester, NY) for 1-3 days (MTL-1 A) or 2 hrs. (actin). 

The distribution profile of MTL-1 A mRNA was examined in a 
panel of Gl and non-Gl human tissues. MTL-1 A mRNA could be 
detected in whole stomach (most prominently), thyroid, and bone 
marrow but was absent from several brain regions and other non-CNS 

25 tissues. 
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WHAT IS CLAIMED: 

1. A motilin receptor, substantially free from receptor- 
associated proteins. 

5 

2. A motilin receptor according to Claim 1 which is 

human. 

3. A motilin receptor according to Claim 2 which is 
10 MTL-R1 A having the amino acid sequence SEQ.ID.NO.:3. 

4. A motilin receptor according to Claim 3 having the 
nucleic acid sequence SEQ.ID.NO.:2. 

15 5. A motilin receptor according to Claim 2 which is 

MTL-R1B having the amino acid sequence SEQ.ID.NO.:5. 

6. A motilin receptor according to Claim 5 having the 
nucleic acid sequence SEQ.ID.NO.:4. 

20 

7. A motilin receptor according to Claim 6 which is 
75E7 having the amino acid sequence SEQ.ID.NO*:7. 

8. A method for determining whether a ligand is 
25 capable of binding to a motilin receptor comprising: 

(a) transfecting test cells with an expression vector 
encoding motilin receptor; 

(b) exposing the test cells to the ligand; 

(c) measuring the amount of binding of the ligand 
30 to the motilin receptor; 

(d) comparing the amount of binding of the ligand 
to the motilin receptor in the test cells with the amount of binding of the 
ligand to control cells that have not been transfected with the motilin 
receptor 
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where if the amount of binding of the ligand to the test cells 
is greater than the amount of binding of the ligand to the control cells, 
then the substance is capable of binding to motilin receptor. 
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TTGAAATTATCTGGTCACTGCCGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGTCGA 

GGCGGGTGGACCACCTGGGGTCAGGAGTTCGAGACCAGGCTGGCCAACATGGCGAAACCCTGACTACA 

CAAAAAACACAAAATTTAGCCGGGGCTTGGGCGCTCCTGTGCTCCCAGCTACTCAGGAGGCTGAGGTG 

GGAGGACTGCTTGAGCCTGGGAGGTCGAGGCTGCAGTGAGCTGTGATCGCGCCACTTAAACTCCAGCC 

TGGACGACAGTGAGACCCTGTCTCAAGAAGAAAAAAAGAAAGAAAGAAAGAAAAAAAGAAAAAAAAGA 

AATTATTTGGTCAATTATATGGTCAGCTCCCTCCACCACTCGCGAATTTACAGAAGAGGAGAACTGGG 

CTGGGCGAGACCAGGACTAGCCCAAGATTACACAAGTTACTCGGTTGTAGAGCCAGGATTAGACAGGA 

GAGGCTCTAGATTCTGGTCTAGACTCCCCTCCTATTATTTAGCATTATGGCTTCCTGAGGATTACCAT 

GAGCCCTCCTCCACCGTCAAGCGGCAGCTACCAGCCACCAGACCAGATCCCTTCGAAGGTGCCCGGAG 

TACCAGACTGACAAAAGCGCCCGTACAGTGCTCAGTCCTGTAACCAAAGCTGTCTAGGGTGCAGACAT 

CGCTCACCGGACCGGGTAGGGCTCGTGCGCTAAGGGCGCCGGGTATTCCAGTTAGTGGAGAGGGAAGC 

GCCCTGGAACTGCATGGGCCCGGGAGAGGGCGCGGGAGCGGAGCATGGCCGGGCCGGGGCGGGCCGCG 

GCCGTGGGCGGAGACTGCGCGCAGCTAGCTCGGGAGCGCCTCGGAGCC QCCCCGCAGAGCCGCTTCT 

CGCGCCCCGCAGCGCAGCGCAGCGCTCCGCCGTCTGACCTGCCGCGCCCGCAGCGTGCGGGCTGGGAA 

AGGAGGCGCTCACCGAGAGGGACCACGCGCCAGGCTCCCAGCCCGACCCGGGACGCGGCGGCCGCGCG 

GAGCACCCATGGGCAGCCCCTGGAACGGCAGCGACGGCCCCGAGGGGGCGCGGGAGCCGCCGTGGCCC 

GCGCTGCCGCCTTGCGACGAGCGCCGCTGCTCGCCCTTTCCCCTGGGGGCGCTGGTGCCGGTGACCGC 

TGTGTGCCTGTGCCTGTTCGTCGTCGGGGTGAGCGGCAACGTGGTGACCGTGATGCTGATCGGGCGCT 

ACCGGGACATGCGGACCACCACCAACTTGTACCTGGGCAGCATGGCCGTGTCCGACCTACTCATCCTG 

CTCGGGCTGCCGTTCGACCTGTACCGCCTCTGGCGCTCGCGGCCCTGGGTGTTCGGGCCGCTGCTCTG 

CCGCCTGTCCCTCTACGTGGGCGAGGGCTGCACCTACGCCACGCTGCTGCACATGACCGCGCTCAGCG 

TCGAGCGCTACCTGGCCATCTGCCGCCCGCTCCGCGCCCGCGTCTTGGTCACCCGGCGCCGCGTCCGC 

GCGCTCATCGCTGTGCTCTGGGCCGTGGCGCTGCTCTCTGCCGGTCCCTTCTTGTTCCTGGTGGGCGT 

CGAGCAGGACCCCGGCATCTCCGTAGTCCCGGGCCTCAATGGCACCGCGCGGATCGCCTCCTCGCCTC 

TCGCCTCGTCGCCGCCTCTCTGGCTCTCGCGGGCGCCACCGCCGTCCCCGCCGTCGGGGCCCGAGACC 

GCGGAGGCCGCGGCGCTGTTCAGCCGCGAATGCCGGCCGAGCCCCGCGCAGCTGGGCGCGCTGCGTGT 

CATGCTGTGGGTCACCACCGCCTACTTCTTCCTGCCCTTTCTGTGCCTCAGCATCCTCTACGGGCTCA 

TCGGGCGGGAGCTGTGGAGCAGCCGGCGGCCGCTGCGAGGCCCGGCCGCCTCGGGGCGGGAGAGAGGC 

CACCGGCAGACCGTCCGCGTCCTGCgtaagtggagccgccgtggttccaaagacgcctgcctgcagtc 

cgccccgccggggaccgcgcaaacgctccctccccttcccctgctcgcccagctctgggcgccgcttc 

cagctcccttcctatttcgattccagcctccacccgccggtcattcccatcccccgagaaaaccatgt 

cctgtcccccaggagctctgggggaccccagggcgctttgagggtgggatccccggatccgattcagt 

aaccagcagtgcttttccagagcctctgagaccagaaaggagagttggtaattcttaatccaaccacc 

tgttagatgccacaaatgaggagtcctcacagtgctcttgagaagacgagggagatttcattaagcta 

aaattttttatttaatgttaagtgatgctgaaggctaaagtaaaccttgctcgtatcaaaaagtaaag 

attgtgcagacctgttgtagaattcttttcaacagagaacagaaaacttgtctccgaagtgggtttgt 

ggaaggaagcctgccaaggcggcttgttcagagaaattgctccttctggtttatgtccagccttgata 

acacatatgggagcctactatgcagttttaaagcaagtatccatgcagcctgcagcctggtcattttt 

tctggggtgaggatctgcctaggtagaagttttctctaatttattttgctgttacttgttattgcaga 

tggttccttgtcggggtggggggtttatttgcttcccaatgcttttgttaatcccggtgctgtgtctt 

atgttgcagTGGTGGTGGTTCTGGCATTTATAATTTGCTGGTTGCCCTTCCACGTTGGCAGAATCATT 

TACATAAACACGGAAGATTCGCGGATGATGTACTTCTCTCAGTACTTTAACATCGTCGCTCTGCAACT 

TTTCTATCTGAGCGCATCTATCAACCCAATCCTCTACAACCTCATTTCAAAGAAGTACAGAGCGGCGG 

CCTTTAAACTGCTGCTCGCAAGGAAGTCCAGGCCGAGAGGCTTCCACAGAAGCAGGGACACTGCGGGG 

GAAGTTGCAGGGGACACTGGAGGAGACACGGTGGGCTACACCGAGACAAGCGCTAACGTGAAGACGAT 

GGGATAA 
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ATGGGCAGCCCCTGGAACGGCAGCGACGGCCCCGAGGGGGCGCGGGAGCCGCCGTGGCCCGCGCTG 

CCGCCTTGCGACGAGCGCCGCTGCTCGCCCTTTCCCCTGGGGGCGCTGGTGCCGGTGACCGCTGTG 

TGCCTGTGCCTGTTCGTCGTCGGGGTGAGCGGCAACGTGGTGACCGTGATGCTGATCGGGCGCTAC 

CGGGACATGCGGACCACCACCAACTTGTACCTGGGCAGCATGGCCGTGTCCGACCTACTCATCCTG 

CTCGGGCTGCCGTTCGACCTGTACCGCCTCTGGCGCTCGCGGCCCTGGGTGTTCGGGCCGCTGCTC 

TGCCGCCTGTCCCTCTACGTGGGCGAGGGCTGCACCTACGCCACGCTGCTGCACATGACCGCGCTC 

AGCGTCGAGCGCTACCTGGCCATCTGCCGCCCGCTCCGCGCCCGCGTCTTGGTCACCCGGCGCCGC 

GTCCGCGCGCTCATCGCTGTGCTCTGGGCCGTGGCGCTGCTCTCTGCCGGTCCCTTCTTGTTCCTG 

GTGGGCGTCGAGCAGGACCCCGGCATCTCCGTAGTCCCGGGCCTCAATGGCACCGCGCGGATCGCC 

TCCTCGCCTCTCGCCTCGTCGCCGCCTCTCTGGCTCTCGCGGGCGCCACCGCCGTCCCCGCCGTCG 

GGGCCCGAGACCGCGGAGGCCGCGGCGCTGTTCAGCCGCGAATGCCGGCCGAGCCCCGCGCAGCTG 

GGCGCGCTGCGTGTCATGCTGTGGGTCACCACCGCCTACTTCTTCCTGCCCTTTCTGTGCCTCAGC 

ATCCTCTACGGGCTCATCGGGCGGGAGCTGTGGAGCAGCCGGCGGCCGCTGCGAGGCCCGGCCGCC 

TCGGGGCGGGAGAGAGGCCACCGGCAGACCGTCCGCGTCCTGCTGGTGGTGGTTCTGGCATTTATA 

ATTTGCTGGTTGCCCTTCCACGTTGGCAGAATCATTTACATAAACACGGAAGATTCGCGGATGATG 

TACTTCTCTCAGTACTTTAACATCGTCGCTCTGCAACTTTTCTATCTGAGCGCATCTATCAACCCA 

ATCCTCTACAACCTCATTTCAAAGAAGTACAGAGCGGCGGCCTTTAAACTGCTGCTCGCAAGGAAG 

TCCAGGCCGAGAGGCTTCCACAGAAGCAGGGACACTGCGGGGGAAGTTGCAGGGGACACTGGAGGA 

GACACGGTGGGCTACACCGAGACAAGCGCTAACGTGAAGACGATGGGATAA 

FIG. 2 
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MGSPWNGSDGPEGAREPPWPALPPCDERRCSPFPLGALVPVTAVCLCLFVVGVSGNVVTVMLIGRY 
RDMRTTTNLYLGSMAVSDLLILLGLPFDLYRLWRSRPWVFGPLLCRLSLYVGEGCTYATLLHMTAL 
SVERYLAICRPLRARVLVTRRRVRALIAVLWAVALLSAGPFLFLVGVEQDPGISVVPGLNGTARIA 
SSPLASSPPLWLSRAPPPSPPSGPETAEAAALFSRECRPSPAQLGALRVMLWVTTAYFFLPFLCLS 
ILYGLIGRELWSSRRPLRGPAASGRERGHRQTVRVLLVVVLAFI ICWLPFHVGRIIYINTEDSRMM 
YFSQYFNIVALQLFYLSASINPILYNLISKXYRAAAFKLLLARKSRPRGFHRSRDTAGEVAGDTGG 
DTVG YTETSANV KTMG 
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ATGGGCAGCCCCTGGAACGGCAGCGACGGCCCCGAGGGGGCGCGGGAGCCGCCGTGGCCCGCGCTG 

CCGCCTTGCGACGAGCGCCGCTGCTCGCCCTTTCCCCTGGG&GCGCTGGTGCCGGTGACCGCTGTG 

TGCCTGTGCCTGTTCGTCGTCGGGGTGAGCGGCAACGTGGTGACCGTGATGCTGATCGGGCGCTAC 

CGGGACATGCGGACCACCACCAACTTGTACCTGGGCAGCATGGCCGTGTCCGACCTACTCATCCTG 

CTCGGGCTGCCGTTCGACCTGTACCGCCTCTGGCGCTCGCGGCCCTGGGTGTTCGGGCCGCTGCTC 

TGCCGCCTGTCCCTCTACGTGGGCGAGGGCTGCACCTACGCCACGCTGCTGCACATGACCGCGCTC 

AGCGTCGAGCGCTACCTGGCCATCTGCCGCCCGCTCCGCGCCCGCGTCTTGGTCACCCGGCGCCGC 

GTCCGCGCGCTCATCGCTGTGCTCTGGGCCGTGGCGCTGCTCTCTGCCGGTCCCTTCTTGTTCCTG 

GTGGGCGTCGAGCAGGACCCCGGCATCTCCGTAGTCCCGGGCCTCAATGGCACCGCGCGGATCGCC 

TCCTCGCCTCTCGCCTCGTCGCCGCCTCTCTGGCTCTCGCGGGCGCCACCGCCGTCCCCGCCGTCG 

GGGCCCGAGACCGCGGAGGCCGCGGCGCTGTTCAGCCGCGAATGCCGGCCGAGCCCCGCGCAGCTG 

GGCGCGCTGCGTGTCATGCTGTGGGTCACCACCGCCTACTTCTTCCTGCCCTTTCTGTGCCTCAGC 

ATCCTCTACGGGCTCATCGGGCGGGAGCTGTGGAGCAGCCGGCGGCCGCTGCGAGGCCCGGCCGCC 

TCGGGGCGGGAGAGAGGCCACCGGCAGACCGTCCGCGTCCTGCGTAAGTGGAGCCGCCGTGGTTCC 

AAAGACGCCTGCCTGCAGTCCGCCCCGCCGGGGACCGCGCAAACGCTGGGTCCCCTTCCCCTGCTC 

GCCCAGCTCTGGGCGCCGCTTCCAGCTCCCTTTCCTATTTCGATTCCAGCCTCCACCCGCCGTGGT 

GGTGGTTCTGGCATTTATAATTTGCTGGTTGCCCTTCCACGTTGGCAGAATCATTTACATAAACAC 

GGAAGATTCGCGGATGATGTACTTCTCTCAGTACTTTAACATCGTCGCTCTGCAACTTTTCTATCT 

GAGCGCATCTATCAACCCAATCCTCTACAACCTCATTTCAAAGAAGTACAGAGCGGCGGCCTTTAA 

ACTGCTGCTCGCAAGGAAGTCCAGGCCGAGAGGCTTCCACAGAAGCAGGGACACTGCGGGGGAAGT 

TGCAGGGGACACTGGAGGAGACACGGTGGGCTACACCGAGACAAGCGCTAACGTGAAGACGATGGG 

ATAA 
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MGSPWNGSDGPEGAREPPWPALPPCDERRCSPFPLGALVPVTAVCLCLFVVGVSGNVVIVMLIGRY 
RDMRTTTNLYLGSMAVSDLLILLGLPFDLYRLWRSRPWVFGPLLCRL5LYVGEGCTYATLLHMTAL 
SVERYLAICRPLRARVLVTRRRVRALIAVLWAVALLSAGPFLFLVGVEQDPG1SVVPGLNGTARIA 
SSPLASSPPLWLSRAPPPSPPSGPETAEAAALFSRECRPSPAQLGALRVMLWVTTAYFFLPFLCLS 
ILYGLIGRELWSSRRPLRGPAASGRERGHRQTVRVLRKWSRRGSKDACLQSAPPGTAQTLGPLPLL 
AQLWAPLPAPFPISIPASTRRGGGSGIYNLLVALPRWQNHLHKHGRFADDVLLSVL 



FIG. 5 
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ATGCCCTGGACCAGACCCCAGGTGGACCTCCATGCTGCTGCAGCAGAGACCAT6GACCAGTACACC 
ACGGACGACCACCACTACGAGGGCTCCCTCTTCCCCGCGTCCACCCTCATCCCCGTCACGGTCATC 
TGCATCCTCATCTTCGTGGTCGGCGTGACCGGCAACACCATGACCATCCTCATCATCCAGTACTTC 
AAGGACATGAAGACCACCACCAACCTGTACCTGTCCAGCATGGCCGTGTCCGACCTCGTCATCTTC 
CTCTGCCTGCCCTTCGACCTGTACCGCCTGTGGAAGTACGTGCCGTGGCTGTTCGGCGAGGCCGTG 
TGCCGCCTCTACCACTACATCTTCGAAGGCTGCACGTCGGCCACCATCCTCCACATCACGGCCCTG 
AGCATCGAGCGCTACCTGGCCATCAGCTTCCCCCTCAGGAGCAAGGTGATGGTGACCAGGAGAAGG 
GTCCAGTACATCATCCTGGCCCTGTGGTGCTTCGCCCTGGTGTCGGCCGCTCCCACGCTCTTCCTG 
GTCGGGGTGGAGTACGACAACGAGACGCACCCCGACTACAACACGGGCCAGTGCAAGCACACGGGC 
TACGCCATCAGCTCGGGGCAGCTGCACATCATGATCTGGGTGTCCACCACCTACTTCTTCTGCCCG 
ATGCTGTGTCTCCTCTTCCTCTACGGCTCCATCGGGTGCAAGCTGTGGAAGAGCAAGAACGACCTG 
CAGGGCCCGTGCGCCCTGGCCCGCGAGAGGTCGCACAGGCAAACGGTGAAGATCCTGGTGGTGGTG 
GTGCTGGCCTTCATCATCTGCTGGCTGCCCTACCACATCGGCAGGAACCTGTTCGCCCAGGTGGAC 
GACTACGACACGGCCATGCTCAGCCAGAATTTCAACATGGCCTCCATGGTGCTCTGCTACCTCAGC 
GCCTCCATCAACCCCGTCGTCTACAACCTGATGTCGAGGAAGTACCGGGCCGCCGCCAAGCGCCTC 
TTCCTGCTCCACCAGAGACCCAAGCCGGCCCACCGGGGGCAGGGGCAGTTTTGCATGATCGGCCAC 
AGCCCCACCCTGGACGAGAGCCTGACGGGGGTGTGA 



FIG. 7 
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MPWTRPQVDL HAAAAETMDQ YTTDDHHYEGSL FPASTL I PVTV I C I L I F W GVTGNT 
MTILIIQYFKDMKTTTNLYLSSMAVSDLVIFLCLPFDLYRLWKYVPWLFGEAVCRLY 
HYIFEGCTSATILHITALSIERYLAISFPLRSKVMVTRRRVQYI ILALWCFALVSAA 
PTLFLVGVEYDNETHPDYNTGQCKHTGYAISSGQLHIMIWVSTTYFFCPMLCLLFLY 
GSIGCKLWKSKNDLQGPCALARERSHRQTVKILVVVVLAFI ICWLPYHIGRNLFAQV 
DDYDTAMLSQNFNMASMVLCYLSAS I NPVVYNLMSRKYRAAAKRLFLLHQRPKPAHR 
GQGQFCMIGHSPTLDESLTGV 



FIG. 8 
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pu75E7 1 MPWTRPQVDLHAAAAETMDQYTTDDHHYEGSL FPASTL I PVTV IC I L I 48 

II I I I III hill :|: : 

huMTLR 1 MGSPWNGS . . DGPEGAREPPWPALPPCDERRCSPFPLGALVPVTAVCLCL 48 

49 F VVGVTGNTMTI L I IQYFKDMKTTTNLYLSSMAVSDLV I FLCLPFDL YRL 98 

|l I II .|:::| MINI Mill MMIIIIII 

49 FVVGVSGNVVTVMLIGRYRDMRTTTNLYLGSMAVSDLLILLGLPFDLYRL 98 

99 WKYVPWLFGEAVCRLYHYIFEGCTSATILHITALSIERYLAISFPLRSKV 148 

|: I -Ml I: I I II I 'II I ■ II 

99 WRSRPWVFGPLLCRLSLYVGEGCTYATLLHMTALSVERYLAICRPLRARV 148 

149 MVTRRRVQY 1 1 LALWCFAL VSAAPTLFLVGVE YD 182 

Mm M II MM I Ml I 

149 LVTRRRVRALIAVLWAVALLSAGPFLFLVGVEQDPGISVVPGLNGTARIA 198 

183 .... NETHPDYNTGQCKHTGYAISS GQLHIM 209 

I -I I I I 

199 SSPLASSPPLWLSRAPPPSPPSGPETAEAAALFSRECRPSPAQLGALRVM 248 

210 IWVSTTYFFCPMLCLLFLYGSIGCKLWKSKNDLQGPCALARERSHRQTVK 259 

I I Ml I I I I III II II I : IMI I III Mill: 
249 LWVTTAYFFLPFLCLSILYGLIGRELWSSRRPLRGPAASGRERGHRQTVR 298 

260 ILVVVVLAFIICWLPYHIGRNLFAQVDDYDTAMLSQNFNMASMVLCYLSA 309 

:|.|IMIIIIIIIMIMI :: M II II- I I I I I 

299 VLLVVVLAFI ICWLPFHVGRI I YINTEDSRMMYFSQYFNIVALQLFYLSA 348 

310 SINPVVYNIMSRRYRAAAKRLFLLHQ.RPKPAHRGQ. . .GQFCMIGHSPT 355 

||||:-IM-i:|l!lll : I I • I Ml I 

349 S I NP I LYNL ISKKYRAAAFKLLLARKSRPRGFHRSRDTAGE VAGDTGGDT 398 

356 LDESLTGV 363 

- • I 

399 VGYTETSANVKTMG 412 
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Rah way 



State 



NJ 



ZIP 



07065-0907 



Country 



USA 



Telephone (732)594- 1 273 



Fax 



(732)594-4720 



I hereby declare that all statements made herein of my own knowledge are true and that all statements made on information and 
belief are believed to be true; and further that these statements were made with the knowledge that willful false statements and 
the like so made are punishable by fine or imprisonment, or both, under 18 U.S.C. 1001 and that such willful false statements 
may jeopardize the validity of the application or any patent issued thereon. 



Name of Sole or First Inventor: 



| | A petition has been filed for this unsigned inventor 



Given Name (first and middle \if any]) 




Family Name or Surname 



_ScotjJi_ 



Inventor's 
Signature 



Date 



Residence: 
City 



Holmdel 



USA 



Citizenship 



US 



Post Office 
Address 



Merck & Co., Inc., P.O Box 2000 



City 



Rahway 



State NJ 



ZIP 



07065-0907 



I I Additional inventors are being named on the supplemental Additional Inventors(s) sheet(s) PTO/SB/02A attached hereto. 



[Page 2 of 4 ] 
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Please type aplus sign (■¥) inside this box 



I 

^H^roved for use through 9/30/2000 OMB 651-0032 
SUBSTITUTE for^O/SB/02A (3-97), Declaration (Additional Inventors) 



DECLARATION AND POWER OF ATTORNEY 



ADDITIONAL INVENTOR(S) 
Supplemental Sheet 



3^ 



4 



Name of Additional Joint Inventor, if any: 


| | A petition has been filed for this unsigned inventor 


Given Name (first and middle \\f any]) 


Family Name or Surname 


Arthur A. A 


Patchett 


Inventor's 
Signature 




Date 




Residence: 
C.itv 


Westfield^ 


State 


nj siy 


Country 


USA 


Citizenship 


US 


Post Office 
Address 


Merck & Co., Inc., P.O. Box 2000 


City 


Rahway ^ 


State 


nj : 


LIP 


07065-0907 


Name of Additional Jc 


>int Inventor, if any: 


| | A petition has been filed for this unsigned inventor 


Given Name (first and middle [if anyl) 


Family Name or Surname 


Carina /j^C^ ^ 


Tan 


Inventor's 
Signature 




Date 


Hag 21 


Residence: 
Citv 


Metuchen ' 


State 


nj MX 


Country 


USA 


Citizenship 


Malaysian 


Post Office 
Address 




Merck & Co., Inc., P.O. Box 2000 


City 


Rahway I 


State 


NJ 


ZIP 


07065-0907 


Name of Additional J< 


jint Inventor, if any: 


| | A petition has been filed for this unsigned inventor 


Given Name (first and middle [if anyl) 


Family Name or Surname 


Karen Kulju 


McKee 


Inventor's 
Signature 




Date 




Residence: 
Citv 


Middletnwn 


State 


NJ MX 


^Country 


USA 


Citizenship 


US 


Post Office 
Address 




Merck & Co., Inc., P.O. Box 2000 


City 


Rahway 


State 


NJ 


ZIP 


07065-0907 


Name of Additional J 


oint Inventor, if any: 


| | A petition has been filed for this unsigned inventor 


Given Name (first and middle [if any]) 


Family Name or Surname 


Douglas _ 


.MacNeiL 


Inventor's 
Signature 




Date 




Residence: 
Citv 


^ - / 


State 


NJ MS 


Country 


USA 


Citizenship 


US 


Post Office 
Address 


Merck & Co., Inc., P.O. Box 2000 


City 


Rahway 


State 


NJ 


ZIP 


07065-0907 
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Please type a plus sign inside this box 



^H^roved for use through 9/30/2000. OMB 651-0032 
SUBSTITUTE forFTO/SB/02A (3-97), DeclaraUon (Additional Inventors) 



DECLARATION AND POWER OF ATTORNEY 



ADDITIONAL INVENTOR(S) 
Supplemental Sheet 



Name of Additional Joint Inventor, if any: 



[ 1 A petition has been filed for this unsigned inventor 



Given Name (first and middle [if any]) 



Family Name or Surname 



Inventor's 
Signature 



Howard 



Date 



Residence: 
City 



Park Ridge 



State 



NJ 



JUS 



Country 



USA 



Citizenship 



US 



Post Office 
Address 



Merck & Co., Inc., P.O. Box 2000 



City 



Rah way 



State NJ 



ZIP 



07065-0907 



Name of Additional Joint Inventor, if any: 



| | A petition has been filed for this unsigned inventor 



Given Name (first and middle [if any]) 



Family Name or Surname 



Sheng-Shung 

Inventor's 
Signature 



Pong 



2J A/*^ 2-oof 




Name of Additional Joint Inventor, if any: 



| | A petition has been filed for this unsigned inventor 



Given Name (first and middle \'\f any]) 



Family Name or Surname 




Smith 



State 



TX 



Country 



Date 



USA 



11° 



Citizenship 



US 



Post Office 
Address 



Merck & Co., Inc., P.O. Box 2000 



Rah way 



State NJ 



ZIP 



07065-0907 



Name of Additional Joint Inventor, if any: 



| | A petition has been filed for this unsigned inventor 



Given Name (first and middle [if any]) 



Family Name or Surname 



Inventor's 
Signature 



Date 



Residence: 
City 



State 



Country 



Citizenship 



Post Office 
Address 



Merck & Co., Inc., P O. Box 2000 



City 



Rah way 



State 



NJ 



ZIP 



07065-0907 



[Page 4 of 4 ] 

Computer generated form " Declaration Pwr of Atty" (Application Filing Folder), Merck & Co., Inc. 05/23/2000 



WO 99/64436 



PCTYUS99/12773 



SEQUENCE LISTING 
(1) GENERAL INFORMATION: 
(i) APPLICANT: Merck & Co., Inc. 

(ii) TITLE OF INVENTION: CLONING AND IDENTIFICATION 

OF THE MOTILIN RECEPTOR 

(iii) NUMBER OF SEQUENCES: 15 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Merck & Co., Inc. 

(B) STREET: P.O. Box 2000, 126 E. Lincoln Ave. 

(C) CITY: Rahway 

(D) STATE: NJ 

(E) COUNTRY: USA 

(F) ZIP: 07065-0900 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: Windows 

(D) SOFTWARE: FastSEQ for Windows Version 2.0b 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA; 

(A) APPLICATION NUMBER: 60/089,098 

(B) FILING DATE: 12-JUN-1998 



(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Giesser, Joanne M 

(B) REGISTRATION NUMBER: 32,838 

(C) REFERENCE/ DOCKET NUMBER: 2 0251 PCT 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 732-594-3046 

(B) TELEFAX: 732-594-4720 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 




WO 99/64436 



PCT/US99/12773 



<xi) 



SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



TTGAAATTAT 
GAGGTCGAGG 
CGAAACCCTG 
CCAGCTACTC 
TGAGCTGTGA 
AGAAAAAAAG 
GGTCAGCTCC 
AGGACTAGCC 
CTCTAGATTC 
CCATGAGCCC 
AGGTGCCCGG 
GCTGTCTAGG 
GGGTATTCCA 
CGGGAGCGGA 
CTAGCTCGGG 
GCGCAGCGCT 



CTGGTCACTG 
CGGGTGGACC 
ACTACACAAA 
AGGAGGCTGA 
TCGCGCCACT 
AAAGAAAGAA 
CTCCACCACT 
CAAGATTACA 
TGGTCTAGAC 
TCCTCCACCG 
AGTACCAGAC 
GTGCAGACAT 
GTTAGTGGAG 
GCATGGCCGG 
AGCGCCTCGG 
CCGCCGTCTG 



CCGGGCGCGG TGGCTCACGC 
AC CTGGGGTC AGGAGTTCGA 
AAACACAAAA TTTAGCC GGG 
GGTGGGAGGA CTGCTTGAGC 
TAAACTCCAG CCTGGACGAC 
AGAAAAAAAG AAAAAAAAGA 
CGCGAATTTA CAGAAGAGGA 
CAAGTTACTC GGTTGTAGAG 
TCCCCTCCTA TTATTTAGCA 
TCAAGCGGCA GCTACCAGCC 
TGACAAAAGC GCCCGTACAG 
CGCTCACCGG ACCGGGTAGG 
AGGGAAGCGC CCTGGAACTG 
GCCGGGGCGG- GCCGCGGCCG 
AGCCCACCCC GCAGAGCCGC 
ACCTGCCGCG CCCGCAGCGT 



CTGTAATCCC AGCACTTTGG 

GACCAGGCTG GCCAACATGG 

GCTTGGGCGC TCCTGTGCTC 

CTGGGAGGTC GAGGCTGCAG 

AGTGAGACCC TGTCTCAAGA 

AATTATTTGG TCAATTATAT 

GAACTGGGCT GGGCGAGACC 

CCAGGATTAG ACAGGAGAGG 

TTATGGCTTC CTGAGGATTA 

AC C AG AC C AG ATCCCTTCGA 

TGCTCAGTCC TGTAACCAAA 

GCTCGTGCGC TAAGGGCGCC 

CATGGGCCCG GGAGAGGGCG 

TGGGC GG AG A CTGCGCGCAG 

TTCTCGCGCC CCGCAGCGCA 

GCGGGCTGGG AAAGGAGGCG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



C TC AC C GAGA GGGACCACGC GCCAGGCTCC CAGCCCGACC CGGGACGCGG CGGCCGCGCG 102 0 

GAGCACCCAT GGGCAGCCCC TGGAAC GGC A GCGACGGCCC CGAGGGGGCG CGGGAGCCGC 108 0 

CGTGGCCCGC GCTGCCGCCT TGCGACGAGC GCCGCTGCTC GCCCTTTCCC CTGGGGGCGC 114 0 

TGGTGCCGGT GACCGCTGTG TGCCTGTGCC TGTTCGTCGT CGGGGTGAGC GGCAACGTGG 12 00 

TGACCGTGAT GCTGATCGGG CGCTACCGGG ACATGCGGAC CACCACCAAC TTGTACCTGG 126 0 

GCAGCATGGC CGTGTCCGAC CTACTCATCC TGCTCGGGCT GCCGTTCGAC CTGTACCGCC 13 2 0 

TCTGGCGCTC GCGGCCCTGG GTGTTCGGGC CGCTGCTCTG CCGCCTGTCC CTCTACGTGG 13 8 0 

GCGAGGGCTG CACCTACGCC ACGCTGCTGC ACATGACCGC GCTCAGCGTC GAGCGCTACC 144 0 

TGGC C ATCTG CCGCCCGCTC CGCGCCCGCG TCTTGGTCAC CCGGCGCCGC GTCCGCGCGC 1500 

TCATCGCTGT GCTCTGGGCC GTGGCGCTGC TCTCTGCCGG TCCCTTCTTG TTCCTGGTGG 156 0 

GCGTCGAGCA GGACCCCGGC ATCTCCGTAG TCCCGGGCCT CAATGGCACC GCGCGGATCG 162 0 

CCTCCTCGCC TCTCGCCTCG TCGCCGCCTC TCTGGCTCTC GCGGGCGCCA CCGCCGTCCC 168 0 

CGCCGTCGGG GCCCGAGACC GCGGAGGCCG CGGCGCTGTT CAGCCGCGAA TGCCGGCCGA 174 0 

GCCCCGCGCA GCTGGGCGCG CTGCGTGTCA TGC TGTGGGT CACCACCGCC TACTTCTTCC 180 0 

TGCCCTTTCT GTGCCTCAGC ATCCTCTACG GGCTCATCGG GCGGGAGCTG TGGAGCAGCC 1860 

GGCGGCCGCT GCGAGGCCCG GCCGCCTCGG GGCGGGAGAG AGGCCACCGG CAGACCGTCC 192 0 

GCGTCCTGCG TAAGTGGAGC CGCCGTGGTT CCAAAGACGC CTGCCTGCAG TCCGCCCCGC 198 0 

CGGGGACCGC GCAAACGCTG GGTCCCCTTC CCCTGCTCGC CCAGCTCTGG GCGCCGCTTC 2 04 0 

CAGCTCCCTC CTATTTCGAT TCCAGCCTCC ACCCGCCGGT ACTTCCCATC CCCCGAGAAA 210 0 

ACCATGTCCT GTCCCCCAGG AGCTCTGGGG GACCCCAGGG CGCTTTGAGG GTGGGATCCC 216 0 

CGGATCCGAT TCAGTAACCA GCAGTGCTTT TCCAGAGCCT CTGAGACCAG AAAGGAGAGT 222 0 

TGGTAATTCT TAATCCAACC ACCTGTTAGA TGCCACAAAT GAGGAGTCCT CACAGTGCTC 228 0 

TTGAGAAGAC GAGGGAGATT TCATTAAGCT AAAATTTTTT ATTTAATGTT AAGTGATGCT 234 0 

GAAGGCTAAA GTAAAC CTTG CTCGTATCAA AAAGTAAAGA TTGTGCAGAC CTGTTGTAGA 2 40 0 

ATTCTTTTCA ACAGAGAACA GAAAACTTGT CTCCGAAGTG GGTTTGTGGA AGGAAGCCTG 246 0 

CCAAGGCGGC TTGTTCAGAG AAATTGCTCC TTCTGGTTTA TGTCCAGCCT TGATAACACA 2 52 0 

TATGGGAGCC TACTATGCAG TTTTAAAGCA AGTATCCATG CAGCCTGCAG CCTGGTCATT 2 58 0 

TTTTCTGGGG TGAGGATCTG CCTAGGTAGA AGTTTTC TC T AATTTATTTT GCTGTTACTT 2 64 0 

GTTATTGCAG ATGGTTCCTT GTCGGGGTGG GGGGTTTATT TGCTTCCCAA TGC TTTTGTT 270 0 

AATCCCGGTG CTGTGTCTTA TGTTGCAGTG GTGGTGGTTC TGGCATTTAT AATTTGCTGG 276 0 

TTGCCCTTCC ACGTTGGCAG AATCATTTAC AT AAAC AC GG AAGATTCGCG GATGATGTAC 2 82 0 

TTCTCTCAGT ACTTTAACAT CGTCGCTCTG CAACTTTTCT ATCTGAGCGC ATCTATCAAC 2 88 0 

CCAATCCTCT ACAACCTCAT TTCAAAGAAG TACAGAGCGG CGGCCTTTAA ACTGCTGCTC 294 0 

GCAAGGAAGT CCAGGCCGAG AGGCTTCCAC AGAAGCAGGG ACACTGCGGG GGAAGTTGCA 3 00 0 

GGGGACACTG GAGGAGACAC GGTGGGCTAC ACCGAGACAA GCGCTAACGT GAAGACGATG 3 06 0 

GGATAA 3 06 6 
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(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1239 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{ D ) TOPOLOGY : 1 mear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

ATGGGCAGCC CCTGGAACGG CAGCGACGGC CCCGAGGGGG CGCGGGAGCC GCCGTGGCCC 60 

GCGCTGCCGC CTTGCGACGA GCGCCGCTGC TCGCCCTTTC CCCTGGGGGC GCTGGTGCCG 120 

GTGACCGCTG TGTGCCTGTG CCTGTTCGTC GTCGGGGTGA GCGGCAACGT GGTGACCGTG 180 

ATGCTGATCG GGCGCTACCG GGACATGCGG ACCACCACCA ACTTGTACCT GGGCAGCATG 24 0 

GCCGTGTCCG ACCTACTCAT CCTGCTCGGG CTGCCGTTCG ACCTGTACCG CCTCTGGCGC 3 00 

TCGCGGCCCT GGGTGTTCGG GCCGCTGCTC TGCCGCCTGT CCCTCTACGT GGGCGAGGGC 3 60 

TGCACCTACG CCACGCTGCT GCACATGACC GCGCTCAGCG TCGAGCGCTA CCTGGCCATC 420 

TGCCGCCCGC TCCGCGCCCG CGTCTTGGTC ACCCGGCGCC GCGTCCGCGC GCTCATCGCT 480 

GTGCTCTGGG CCGTGGCGCT GCTCTCTGCC GGTCCCTTCT TGTTCCTGGT GGGCGTCGAG 540 

CAGGACCCCG GCATCTCCGT AGTCCCGGGC CTCAATGGCA CCGCGCGGAT CGCCTCCTCG 600 

CCTCTCGCCT CGTCGCCGCC TCTCTGGCTC TCGCGGGCGC CACCGCCGTC CCCGCCGTCG 660 

GGGCCCGAGA CCGCGGAGGC CGCGGCGCTG TTCAGCCGCG AATGCCGGCC GAGCCCCGCG 720 

CAGCTGGGCG CGCTGCGTGT CATGCTGTGG GTCACCACCG CCTACTTCTT CCTGCCCTTT 7 80 

CTGTGCCTCA GCATCCTCTA CGGGCTCATC GGGCGGGAGC TGTGGAGCAG CCGGCGGCCG 840 

CTGCGAGGCC CGGCCGCCTC GGGGCGGGAG AGAGGCCACC GGCAGACCGT CCGCGTCCTG 900 

CTGGTGGTGG TTCTGGCATT TATAATTTGC TGGTTGCCCT TCCACGTTGG CAGAATCATT 960 

TACATAAACA CGGAAGATTC GCGGATGATG TACTTCTCTC AGTACTTTAA CATCGTCGCT 1020 

CTGCAACTTT TCTATCTGAG CGCATCTATC AACCCAATCC TCTACAACCT CATTTCAAAG 1080 

AAGTACAGAG CGGCGGCCTT TAAACTGCTG CTCGCAAGGA AGTCCAGGCC GAGAGGCTTC 1140 

CACAGAAGCA GGGACACTGC GGGGGAAGTT GCAGGGGACA CTGGAGGAGA CACGGTGGGC 12 00 

TACACCGAGA CAAGCGCTAA CGTGAAGACG ATGGGATAA 1239 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 412 amino acids 
{ B ) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Met Gly Ser Pro Trp Asn Gly Ser Asp Gly Pro Glu Gly Ala Arg Glu 

15 10 15 

Pro Pro Trp Pro Ala Leu Pro Pro Cys Asp Glu Arg Arg Cys Ser Pro 

20 25 30 

Phe Pro Leu Gly Ala Leu Val Pro Val Thr Ala Val Cys Leu Cys Leu 

35 40 45 

Phe Val Val Gly Val Ser Gly Asn Val Val Thr Val Met Leu lie Gly 

50 55 60 

Arg Tyr Arg Asp Met Arg Thr Thr Thr Asn Leu Tyr Leu Gly Ser Met 
65 70 75 80 
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Ala 


Val 


Ser 


Asp 


Leu 


Leu 


He 


Leu 


Leu 


Gly 


Leu 


Pro 


Phe 


Asp 


Leu 


Tyr 










85 










90 










95 




Arg 


Leu 


Trp 


Arg 


Ser 


Arg 


Pro 


Trp 


Val 


Phe 


Gly 


Pro 


Leu 


Leu 


Cys 


Arg 








100 










105 










110 






Leu 


Ser 


Leu 


Tyr 


Val 


Gly 


Glu 


Gly 


Cys 


Thr 


Tyr 


Ala 


Thr 


Leu 


Leu 


His 






115 










120 










125 








Met 


Thr 


Ala 


Leu 


Ser 


Val 


Glu 


Arg 


Tyr 


Leu 


Ala 


He 


Cys 


Arg 


Pro 


Leu 




130 










135 










140 










Arg 


Ala 


Arg 


Val 


Leu 


Val 


Thr 


Arg 


Arg 


Arg 


Val 


Arg 


Ala 


Leu 


He 


Ala 


145 










150 










155 










160 


Val 


Leu 


Trp 


Ala 


Val 


Ala 


Leu 


Leu 


Ser 


Ala 


Gly 


Pro 


Phe 


Leu 


Phe 


Leu 










165 










170 










175 




Val 


Gly 


Val 


Glu 


Gin 


Asp 


Pro 


Gly 


He 


Ser 


Val 


Val 


Pro 


Gly 


Leu 


Asn 








180 










185 










190 






Gly 


Thr 


Ala 


Arg 


He 


Ala 


Ser 


Ser 


Pro 


Leu 


Ala 


Ser 


Ser 


Pro 


Pro 


Leu 






195 










200 










205 








Trp 


Leu 


Ser 


Arg 


Ala 


Pro 


Pro 


Pro 


Ser 


Pro 


Pro 


Ser 


Gly 


Pro 


Glu 


Thr 




210 










215 










220 










Ala 


Glu 


Ala 


Ala 


Ala 


Leu 


Phe 


Ser 


Arg 


Glu 


Cys 


Arg 


Pro 


Ser 


Pro 


Ala 


225 










230 










235 










240 


Gin 


Leu 


Gly 


Ala 


Leu 


Arg 


Val 


Met 


Leu 


Trp 


Val 


Thr 


Thr 


Ala 


Tyr 


Phe 










245 










250 










255 




Phe 


Leu 


Pro 


Phe 


Leu 


Cys 


Leu 


Ser 


He 


Leu 


Tyr 


Gly 


Leu 


He 


Gly 


Arg 








260 










265 










270 






Glu 


Leu 


Trp 


Ser 


Ser 


Arg 


Arg 


Pro 


Leu 


Arg 


Gly 


Pro 


Ala 


Ala 


Ser 


Gly 






275 










280 










285 








Arg 


Glu 


Arg 


Gly 


His 


Arg 


Gin 


Thr 


Val 


Arg 


Val 


Leu 


Leu 


Val 


Val 


Val 




290 










295 










300 










Leu 


Ala 


Phe 


He 


He 


Cys 


Trp 


Leu 


Pro 


Phe 


His 


Val 


Gly 


Arg 


He 


He 


305 










310 










315 










320 


Tyr 


He 


Asn 


Thr 


Glu 


Asp 


Ser 


Arg 


Met 


Met 


Tyr 


Phe 


Ser 


Gin 


Tyr 


Phe 










325 










330 










335 




Asn 


lie 


Val 


Ala 


Leu 


Gin 


Leu 


Phe 


Tyr 


Leu 


Ser 


Ala 


Ser 


He 


Asn 


Pro 








340 










345 










350 






He 


Leu 


Tyr 


Asn 


Leu 


He 


Ser 


Lys 


Lys 


Tyr 


Arg 


Ala 


Ala 


Ala 


Phe 


Lys 






355 










360 










365 








Leu 


Leu 


Leu 


Ala 


Arg 


Lys 


Ser 


Arg 


Pro 


Arg 


Gly 


Phe 


His 


Arg 


Ser 


Arg 




370 










375 










380 










Asp 


Thr 


Ala 


Gly 


Glu 


Val 


Ala 


Gly 


Asp 


Thr 


Gly 


Gly 


Asp 


Thr 


Val 


Gly 


385 










390 










395 










400 


Tyr 


Thr 


Glu 


Thr 


Ser 


Ala 


Asn 


Val 


Lys 


Thr 


Met 


Gly 











405 410 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1390 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
ATGGGCAGCC CCTGGAACGG C AGC GACGGC CCCGAGGGGG CGCGGGAGCC GCCGTGGCCC 60 
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GCGCTGCCGC CTTGCGACGA GCGCCGCTGC TCGCCCTTTC CCCTGGGGGC GCTGGTGCCG 12 0 

GTGACCGCTG TGTGCCTGTG CCTGTTCGTC GTCGGGGTGA GCGGCAACGT GGTGACCGTG 18 0 

ATGCTGATCG GGCGCTACCG GGACATGCGG ACCACCACCA ACTTGTACCT GGGCAGCATG 240 

GCCGTGTCCG ACCTACTCAT CCTGCTCGGG CTGCCGTTCG ACCTGTACCG CCTCTGGCGC 3 00 

TCGCGGCCCT GGGTGTTCGG GCCGCTGCTC TGCCGCCTGT CCCTCTACGT GGGCGAGGGC 3 60 

TGCACCTACG CCACGCTGCT GC AC ATGAC C GCGCTCAGCG TCGAGCGCTA CCTGGCCATC 420 

TGCCGCCCGC TCCGCGCCCG CGTCTTGGTC ACCCGGCGCC GCGTCCGCGC GCTCATCGCT 48 0 

GTGCTCTGGG CCGTGGCGCT GCTCTCTGCC GGTCCCTTCT TGTTCCTGGT GGGCGTCGAG 54 0 

CAGGACCCCG GCATCTCCGT AGTCCCGGGC CTCAATGGCA CCGCGCGGAT CGCCTCCTCG 600 

CCTCTCGCCT CGTCGCCGCC TCTCTGGCTC TCGCGGGCGC CACCGCCGTC CCCGCCGTCG 660 

GGGCCCGAGA CCGCGGAGGC CGCGGCGCTG TTCAGCCGCG AATGCCGGCC GAGCCCCGCG 72 0 

CAGCTGGGCG CGCTGCGTGT CATGCTGTGG GTCACCACCG CCTACTTCTT CCTGCCCTTT 7 80 

CTGTGCCTCA GCATCCTCTA CGGGCTCATC GGGC GGGAGC TGTGG AGC AG CCGGCGGCCG 84 0 

CTGCGAGGCC CGGCCGCCTC GGGGCGGGAG AGAGGCCACC GGCAGACCGT CCGCGTCCTG 900 

CGTAAGTGGA GCCGCCGTGG TTCCAAAGAC GCCTGCCTGC AGTCCGCCCC GCCGGGGACC 960 

GCGCAAACGC TGGGTCCCCT TCCCCTGCTC GCCCAGCTCT GGGCGCCGCT TCCAGCTCCC 1020 

TTTCCTATTT CGATTCCAGC CTCCACCCGC CGTGGTGGTG GTTCTGGCAT TTATAATTTG 1080 

CTGGTTGCCC TTCCACGTTG GCAGAATCAT TTACATAAAC AC GG AAGATT C GC GGATGAT 1140 

GTACTTCTCT CAGTACTTTA ACATCGTCGC TCTGCAACTT TTCTATCTGA GCGCATCTAT 12 0 0 

CAACCCAATC CTCTACAACC TCATTTCAAA GAAGTACAGA GCGGCGGCCT TTAAACTGCT 12 60 

GCTCGCAAGG AAGTCCAGGC CGAGAGGCTT C C AC AG AAGC AGGGACACTG CGGGGGAAGT 132 0 

TGCAGGGGAC ACTGGAGGAG ACACGGTGGG CTACACCGAG ACAAGCGCTA ACGTGAAGAC 13 8 0 

GATGGGATAA 13 90 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 386 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 



Met 


Gly 


Ser 


Pro 


Trp 


Asn 


Gly 


Ser 


Asp 


Gly 


Pro 


Glu 


Gly 


Ala 


Arg 


Glu 


1 








5 










10 










15 




Pro 


Pro 


Trp 


Pro 


Ala 


Leu 


Pro 


Pro 


Cys 


Asp 


Glu 


Arg 


Arg 


Cys 


Ser 


Pro 








20 










25 










30 






Phe 


Pro 


Leu 


Gly 


Ala 


Leu 


Val 


Pro 


Val 


Thr 


Ala 


Val 


Cys 


Leu 


Cys 


Leu 






35 










40 










45 








Phe 


Val 


Val 


Gly 


Val 


Ser 


Gly 


Asn 


Val 


Val 


Thr 


Val 


Met 


Leu 


He 


Gly 




50 










55 










60 










Arg 


Tyr 


Arg 


Asp 


Met 


Arg 


Thr 


Thr 


Thr 


Asn 


Leu 


Tyr 


Leu 


Gly 


Ser 


Met 


65 










70 










75 










80 


Ala 


Val 


Ser 


Asp 


Leu 


Leu 


He 


Leu 


Leu 


Gly 


Leu 


Pro 


Phe 


Asp 


Leu 


Tyr 










85 










90 










95 




Arg 


Leu 


Trp 


Arg 


Ser 


Arg 


Pro 


Trp 


Val 


Phe 


Gly 


Pro 


Leu 


Leu 


Cys 


Arg 








100 










105 










110 






Leu 


Ser 


Leu 


Tyr 


Val 


Gly 


Glu 


Gly 


Cys 


Thr 


Tyr 


Ala 


Thr 


Leu 


Leu 


His 






115 










120 










125 








Met 


Thr 


Ala 


Leu 


Ser 


Val 


Glu 


Arg 


Tyr 


Leu 


Ala 


He 


Cys 


Arg 


Pro 


Leu 




130 










135 










140 










Arg 


Ala 


Arg 


Val 


Leu 


Val 


Thr 


Arg 


Arg 


Arg 


Val 


Arg 


Ala 


Leu 


He 


Ala 


145 










150 










155 










160 
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Val 


L6U 


Trp 


Ala 


Val 


Ala 


Leu 


Leu 


Ser 


Ala 


Gly 


Pro 


Phe 


Leu 


Phe 


Leu 










1 C c: 
lOD 










170 










175 




Val 


biy 


vai 


CjilU 


Gin 


Asp 


Pro 


Gly 


He 


Ser 


Val 


Val 


Pro 


Gly 


Leu 


Asn 








i on 
lou 










18 5 










190 






Liiy 


Thr 


Ala 


Arg 


lie 


Ala 


Ser 


Ser 


Pro 


Leu 


Ala 


Ser 


Ser 


Pro 


Pro 


Leu 






1 J J 










200 










205 








Trp 


Leu 


O ^ -v- 


Arg 


Ala 


Pro 


Pro 


Pro 


Ser 


Pro 


Pro 


Ser 


Gly 


Pro 


Glu 


Thr 




ziu 










215 










220 








Aid 


bill 


ai a 


Ala 


Ala 


Leu 


Phe 


Ser 


Arg 


Glu 


Cys 


Arg 


Pro 


Ser 


Pro 


Ala 


A AD 










OTA 










235 










240 




Leu 


Gly 


Ala 


Leu 


Arg 


Val 


Met 


Leu 


Trp 


Val 


Thr 


Thr 


Ala 


Tyr 


Phe 










OAK 
AQD 










250 










255 




Phe 


Leu 


Pro 


Phe 


Leu 


Cys 


Leu 


Ser 


He 


Leu 


Tyr 


Gly 


Leu 


He 


Gly 


Arg 








260 










265 










270 


bill 


Leu 


Trp 


Ser 


Ser 


Arg 


Arg 


Pro 


Leu 


Arg 


Gly 


Pro 


Ala 


Ala 


Ser 


Gly 






27 5 










280 










285 






Arg 


Glu 


Arg 


Gly 


His 


Arg 


Gin 


Thr 


Val 


Arg 


Val 


Leu 


Arg 


Lys 


Trp 


Ser 




290 










295 










300 








Arg 


Arg 


Gly 


Ser 


Lys 


Asp 


Ala 


Cys 


Leu 


Gin 


Ser 


Ala 


Pro 


Pro 


Gly 


Thr 


305 










310 










315 








320 


Ala 


Gin 


Thr 


Leu 


Gly 


Pro 


Leu 


Pro 


Leu 


Leu 


Ala 


Gin 


Leu 


Trp 


Ala 


Pro 










325 










330 








335 




Leu 


Pro 


Ala 


Pro 


Phe 


Pro 


He 


Ser 


He 


Pro 


Ala 


Ser 


Thr 


Arg 


Arg 


Gly 








340 










345 










350 




Gly 


Gly 


Ser 


Gly 


He 


Tyr 


Asn 


Leu 


Leu 


Val 


Ala 


Leu 


Pro 


Arg 


Trp 


Gin 






355 










360 










365 




Asn 


His 


Leu 


His 


Lys 


His 


Gly 


Arg 


Phe 


Ala 


Asp 


Asp 


Val 


Leu 


Leu 


Ser 




370 










375 










380 










Val 


Leu 






























385 

































(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1092 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: 

ATGCCCTGGA CCAGACCCCA GGTGGACCTC 

TACACCACGG ACGACCACCA CTACGAGGGC 

GTCACGGTCA TCTGCATCCT CATCTTCGTG 

CTCATCATCC AGTACTTCAA GGACATGAAG 

GCCGTGTCCG ACCTCGTCAT CTTCCTCTGC 

TACGTGCCGT GGCTGTTCGG CGAGGCCGTG 

TGCACGTCGG CCACCATCCT CCACATCACG 

AGCTTCCCCC TCAGGAGCAA GGTGATGGTG 

GCCCTGTGGT GCTTCGCCCT GGTGTCGGCC 

TACGACAACG AGACGCACCC C G AC T AC AAC 

ATCAGCTCGG GGCAGCTGCA CATCATGATC 

ATGCTGTGTC TCCTCTTCCT CTACGGCTCC 

GACCTGCAGG GCCCGTGCGC CCTGGCCCGC 



SEQ ID NO: 6: 

CATGCTGCTG CAGCAGAGAC CATGGACCAG 60 

TCCCTCTTCC CCGCGTCCAC CCTCATCCCC 12 0 

GTCGGCGTGA CCGGCAACAC CATGACCATC 18 0 

ACCACCACCA ACCTGTACCT GTCCAGCATG 24 0 

CTGCCCTTCG ACCTGTACCG CCTGTGGAAG 3 00 

TGCCGCCTCT ACCACTACAT CTTCGAAGGC 360 

GCCCTGAGCA TCGAGCGCTA CCTGGCCATC 42 0 

ACCAGGAGAA GGGTCCAGTA CATCATCCTG 480 

GCTCCCACGC TCTTCCTGGT CGGGGTGGAG 540 

ACGGGCCAGT GCAAGCACAC GGGCTACGCC 600 

TGGGTGTCCA CCACCTACTT CTTCTGCCCG 660 

ATCGGGTGCA AGCTGTGGAA GAGCAAGAAC 72 0 

GAGAGGTCGC ACAGGCAAAC GGTGAAGATC 78 0 
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CTGGTGGTGG TGGTGCTGGC CTTCATCATC TGCTGGCTGC CCTACCACAT CGGCAGGAAC 84 0 

CTGTTCGCCC AGGTGGACGA C T AC G AC AC G GCCATGCTCA GC C AG AATTT CAACATGGCC 9 00 

TCCATGGTGC TCTGCTACCT CAGCGCCTCC ATCAACCCCG TCGTCTACAA CCTGATGTCG 9 60 

AGGAAGTACC GGGCCGCCGC CAAGCGCCTC TTCCTGCTCC ACCAGAGACC CAAGCCGGCC 102 0 

CACCGGGGGC AGGGGCAGTT TTGCATGATC GGCCACAGCC CCACCCTGGA CGAGAGCCTG 10 80 

ACGGGGGTGT GA 10 92 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 363 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 



Met 


Pro 


Trp 


Thr 


Arg 


Pro 


Gin 


Val 


Asp 


Leu 


His 


Ala 


Ala 


Ala 


Ala 


Glu 


1 








5 










10 










15 




Thr 


Met 


Asp 


Gin 


Tyr 


Thr 


Thr 


Asp 


Asp 


His 


His 


Tyr 


Glu 


Gly 


Ser 


Leu 








20 










25 










30 






Phe 


Pro 


Ala 


Ser 


Thr 


Leu 


He 


Pro 


Val 


Thr 


Val 


He 


Cys 


He 


Leu 


He 






35 










40 










45 








Phe 


Val 


Val 


Gly 


Val 


Thr 


Gly 


Asn 


Thr 


Met 


Thr 


He 


Leu 


He 


He 


Gin 




50 










55 










60 










Tyr 


Phe 


Lys 


Asp 


Met 


Lys 


Thr 


Thr 


Thr 


Asn 


Leu 


Tyr 


Leu 


Ser 


Ser 


Met 


65 










70 










75 










80 


Ala 


Val 


Ser 


Asp 


Leu 


Val 


He 


Phe 


Leu 


Cys 


Leu 


Pro 


Phe 


Asp 


Leu 


Tyr 










85 










90 










95 




Arg 


Leu 


Trp 


Lys 


Tyr 


Val 


Pro 


Trp 


Leu 


Phe 


Gly 


Glu 


Ala 


Val 


Cys 


Arg 








100 










105 










110 






Leu 


Tyr 


His 


Tyr 


He 


Phe 


Glu 


Gly 


Cys 


Thr 


Ser 


Ala 


Thr 


He 


Leu 


His 






115 










120 










125 








He 


Thr 


Ala 


Leu 


Ser 


He 


Glu 


Arg 


Tyr 


Leu 


Ala 


He 


Ser 


Phe 


Pro 


Leu 




130 










135 










140 










Arg 


Ser 


Lys 


Val 


Met 


Val 


Thr 


Arg 


Arg 


Arg 


Val 


Gin 


Tyr 


He 


He 


Leu 


145 










150 










155 










160 


Ala 


Leu 


Trp 


Cys 


Phe 


Ala 


Leu 


Val 


Ser 


Ala 


Ala 


Pro 


Thr 


Leu 


Phe 


Leu 










165 










170 










175 




Val 


Gly 


Val 


Glu 


Tyr 


Asp 


Asn 


Glu 


Thr 


His 


Pro 


Asp 


Tyr 


Asn 


Thr 


Gly 








180 










185 










190 






Gin 


Cys 


Lys 


His 


Thr 


Gly 


Tyr 


Ala 


He 


Ser 


Ser 


Gly 


Gin 


Leu 


His 


He 






195 










200 










205 








Met 


He 


Trp 


Val 


Ser 


Thr 


Thr 


Tyr 


Phe 


Phe 


Cys 


Pro 


Met 


Leu 


Cys 


Leu 




210 










215 










220 










Leu 


Phe 


Leu 


Tyr 


Gly 


Ser 


He 


Gly 


Cys 


Lys 


Leu 


Trp 


Lys 


Ser 


Lys 


Asn 


225 










230 










235 










240 


Asp 


Leu 


Gin 


Gly 


Pro 


Cys 


Ala 


Leu 


Ala 


Arg 


Glu 


Arg 


Ser 


His 


Arg 


Gin 










245 










250 










255 




Thr 


Val 


Lys 


He 


Leu 


Val 


Val 


Val 


Val 


Leu 


Ala 


Phe 


He 


He 


Cys 


Trp 








260 










265 










270 






Leu 


Pro 


Tyr 


His 


He 


Gly 


Arg 


Asn 


Leu 


Phe 


Ala 


Gin 


Val 


Asp 


Asp 


Tyr 






275 










280 










285 
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Asp 


Thr 
290 


Ala 


Met 


Leu 


Ser 


Gin 
295 


Asn 


Phe 


Asn 


Met 


Ala 
300 


Ser 


Met 


Val 


Leu 


Cys 


Tyr 


Leu 


Ser 


Ala 


Ser 


He 


Asn 


Pro 


Val 


Val 


Tyr 


Asn 


Leu 


Met 


Ser 


305 










310 










315 










320 


Arg 


Lys 


Tyr 


Arg 


Ala 


Ala 


Ala 


Lys 


Arg 


Leu 


Phe 


Leu 


Leu 


His 


Gin 


Arg 










325 










330 










335 


Pro 


Lys 


Pro 


Ala 
340 


His 


Arg 


Gly 


Gin 


Gly 
345 


Gin 


Phe 


Cys 


Met 


He 
350 


Gly 


His 


Ser 


Pro 


Thr 
355 


Leu 


Asp 


Glu 


Ser 


Leu 
360 


Thr 


Gly 


Val 













(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
CCATCCTAAT ACGACTCACT ATAGGGC 27 
(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
TTATC C CATC GTCTTCACGT TAGCGCTTGT CTC 3 3 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CTGCCCTTTC TGTGCCTCAG CATCCTCTAC 30 
(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

ATGGGCAGCC CCTGGAACGG CAGCGACGGC CCCGAGGGGG CGCGGGAGCC GCCGTGGCCC 60 

GCGCTGCCGC CTTGCGACGA GCGCCGCTGC TCGCCCTTTC CCCTGGGGGC GCTGGTGCCG 12 0 

GTGACCGCTG TGTGCCTGTG CCTGTTCGTC GTCGGGGTGA GCGGCAACGT GGTGACCGTG 180 

ATGCTGATCG GGCGCTACCG GGACATGCGG ACCACCACCA ACTTGTACCT GGGCAGCATG 2 40 

GCCGTGTCCG ACCTACTCAT CCTGCTCGGG CTGCCGTTCG ACCTGTACCG CCTCTGGCGC 3 00 

TCGCGGCCCT GGGTGTTCGG GCCGCTGCTC TGCCGCCTGT CCCTCTACGT GGGCGAGGGC 360 

TGCACCTACG CCACGCTGCT GCACATGACC GCGCTCAGCG TCGAGCGCTA CCTGGCCATC 42 0 

TGCCGCCCGC TCCGCGCCCG CGTCTTGGTC ACCCGGCGCC GCGTCCGCGC GCTCATCGCT 4 80 

GTGCTCTGGG CCGTGGCGCT GCTCTCTGCC GGTCCCTTCT TGTTCCTGGT GGGCGTCGAG 540 

CAGGACCCCG GCATCTCCGT AGTCCCGGGC CTCAATGGCA CCGCGCGGAT CGCCTCCTCG 600 

CCTCTCGCCT CGTCGCCGCC TCTCTGGCTC TCGCGGGCGC CACCGCCGTC CCCGCCGTCG 660 

GGGCCCGAGA CCGCGGAGGC CGCGGCGCTG TTCAGCCGCG AATGCCGGCC GAGCCCCGCG 72 0 

CAGCTGGGCG CGCTGCGTGT CATGCTGTGG GTCACCACCG CCTACTTCTT CCTGCCCTTT 7 80 

CTGTGCCTCA GCATCCTCTA CGGGCTCATC GGGCGGGAGC TGTGGAGCAG CCGGCGGCCG 840 

CTGCGAGGCC CGGCCGCCTC GGGGCGGGAG AGAGGCCACC GGCAGACCGT CCGCGTCCTG 900 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

Met Gly Ser Pro Trp Asn Gly Ser Asp Gly Pro Glu Gly Ala Arg Glu 

15 10 15 

Pro Pro Trp Pro Ala Leu Pro Pro Cys Asp Glu Arg Arg Cys Ser Pro 

20 25 30 

Phe Pro Leu Gly Ala Leu Val Pro Val Thr Ala Val Cys Leu Cys Leu 

35 40 45 

Phe Val Val Gly Val Ser Gly Asn Val Val Thr Val Met Leu lie Gly 

50 55 60 

Arg Tyr Arg Asp Met Arg Thr Thr Thr Asn Leu Tyr Leu Gly Ser Met 
65 70 75 80 

Ala Val Ser Asp Leu Leu lie Leu Leu Gly Leu Pro Phe Asp Leu Tyr 

85 90 95 

Arg Leu Trp Arg Ser Arg Pro Trp Val Phe Gly Pro Leu Leu Cys Arg 

100 105 110 

Leu Ser Leu Tyr Val Gly Glu Gly Cys Thr Tyr Ala Thr Leu Leu His 

115 120 125 

Met Thr Ala Leu Ser Val Glu Arg Tyr Leu Ala lie Cys Arg Pro Leu 
130 135 140 
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Arg 


Ala 


Arg 


Val 


Leu 


Val 


Thr 


Arg 


Arg 


Arg 


Val 


Arg 


Ala 


Leu 


He 


Ala 


145 










150 










155 










160 


Val 


Leu 


Trp 


Ala 


Val 
165 


Ala 


Leu 


Leu 


Ser 


Ala 
170 


Gly 


Pro 


Phe 


Leu 


Phe 
175 


Leu 


Val 


Gly 


Val 


Glu 
180 


Gin 


Asp 


Pro 


Gly 


He 
185 


Ser 


Val 


Val 


Pro 


Gly 
190 


Leu 


Asn 


Gly 


Thr 


Ala 
195 


Arg 


lie 


Ala 


Ser 


Ser 
200 


Pro 


Leu 


Ala 


Ser 


Ser 
205 


Pro 


Pro 


Leu 


Trp 


Leu 
210 


Ser 


Arg 


Ala 


Pro 


Pro 

215 


Pro 


Ser 


Pro 


Pro 


Ser 
220 


Gly 


Pro 


Glu 


Thr 


Ala 


Glu 


Ala 


Ala 


Ala 


Leu 


Phe 


Ser 


Arg 


Glu 


Cys 


Arg 


Pro 


Ser 


Pro 


Ala 


225 










230 










235 










240 


Gin 


Leu 


Gly 


Ala 


Leu 
245 


Arg 


Val 


Met 


Leu 


Trp 
250 


Val 


Thr 


Thr 


Ala 


Tyr 

255 


Phe 


Phe 


Leu 


Pro 


Phe 
260 


Leu 


Cys 


Leu 


Ser 


He 
265 


Leu 


Tyr 


Gly 


Leu 


He 
270 


Gly 


Arg 


Glu 


Leu 


Trp 

275 


Ser 


Ser 


Arg 


Arg 


Pro 
280 


Leu 


Arg 


Gly 


Pro 


Ala 
285 


Ala 


Ser 


Gly 


Arg 


Glu 
290 


Arg 


Gly 


His 


Arg 


Gin 
295 


Thr 


Val 


Arg 


Val 


Leu 
300 











(2) INFORMATION FOR SEQ ID NO: 13: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 base pairs 

(B) TYPE : nucleic acid 

( C ) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

CGTAAGTGGA GCCGCCGTGG TTCCAAAGAC GCCTGCCTGC AGTCCGCCCC GCCGGGGACC 60 
GCGCAAACGC TGGGTCCCCT TCCCCTGCTC GCCCAGCTCT GGGCGCCGCT TCCAGCTCCC 12 0 

TTTCCTATTT CGATTCCAGC CTCCACCCGC CGGT 154 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 02 base pairs 

( B) TYPE : nucleic acid 

( C ) STRANDEDNESS : doubl e 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

AGCTGGTGGT GGTTCTGGCA TTTATAATTT GCTGGTTGCC CTTCCACGTT GGCAGAATCA 60 

TTTACATAAA CACGGAAGAT TCGCGGATGA TGTACTTCTC TCAGTACTTT AACATCGTCG 120 

CTCTGCAACT TTTCTATCTG AGCGCATCTA TCAACCCAAT CCTCTACAAC CTCATTTCAA 180 

AGAAGTACAG AGCGGCGGCC TTTAAACTGC TGCTCGCAAG GAAGTCCAGG CCGAGAGGCT 240 

TCCACAGAAG CAGGGACACT GC GGGGGAAG TTGCAGGGGA C AC TGGAGGA GACACGGTGG 300 

GC T AC AC C G A GACAAGCGCT AACGTGAAGA CGATGGGATA ACGTAAGTGG AGCCGCCGTG 3 60 

GTTCCAAAGA CGCCTGCCTG CAGTCCGCCC CGCCGGGGAC CGCGCAAACG CTGGGTCCCC 42 0 
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TTCCCCTGCT CGCCCAGCTC TGGGCGCCGC TTCCAGCTCC CTTTCCTATT TCGATTCCAG 48 0 

CCTCCACCCG CCGTGGTGGT GGTTCTGGCA TTTATAATTT GCTGGTTGCC CTTCCACGTT 54 0 

GGCAGAATCA TTTACATAAA CACGGAAGAT TCGCGGATGA TGTACTTCTC TCAGTACTTT 600 

AA 602 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ingle 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



Leu 


Val 


Val 


Val 


Leu 


Ala 


Phe 


He 


He 


Cys 


Trp 


Leu 


Pro 


Phe 


His 


Val 


1 








5 










10 










15 




Gly 


Arg 


He 


He 
20 


Tyr 


He 


Asn 


Thr 


Glu 
25 


Asp 


Ser 


Arg 


Met 


Met 
30 


Tyr 


Phe 


Ser 


Gin 


Tyr 
35 


Phe 


Asn 


He 


Val 


Ala 
40 


Leu 


Gin 


Leu 


Phe 


Tyr 

45 


Leu 


Ser 


Ala 


Ser 


He 
50 


Asn 


Pro 


He 


Leu 


Tyr 
55 


Asn 


Leu 


He 


Ser 


Lys 
60 


Lys 


Tyr 


Arg 


Ala 


Ala 


Ala 


Phe 


Lys 


Leu 


Leu 


Leu 


Ala 


Arg 


Lys 


Ser 


Arg 


Pro 


Arg 


Gly 


Phe 


65 










70 










75 










80 


His 


Arg 


Ser 


Arg 


Asp 
85 


Thr 


Ala 


Gly 


Glu 


Val 
90 


Ala 


Gly 


Asp 


Thr 


Gly 
95 


Gly 


Asp 


Thr 


Val 


Gly 
100 


Tyr 


Thr 


Glu 


Thr 


Ser 
105 


Ala 


Asn 


Val 


Lys 


Thr 
110 


Met 


Gly 


Arg 


Lys 


Trp 
115 


Ser 


Arg 


Arg 


Gly 


Ser 
120 


Lys 


Asp 


Ala 


Cys 


Leu 
125 


Gin 


Ser 


Ala 


Pro 


Pro 
130 


Gly 


Thr 


Ala 


Gin 


Thr 
135 


Leu 


Gly 


Pro 


Leu 


Pro 
140 


Leu 


Leu 


Ala 


Gin 


Leu 


Trp 


Ala 


Pro 


Leu 


Pro 


Ala 


Pro 


Phe 


Pro 


He 


Ser 


He 


Pro 


Ala 


Ser 


145 










150 










155 










160 


Thr 


Arg 


Arg 


Gly 


Gly 
165 


Gly 


Ser 


Gly 


He 


Tyr 
170 


Asn 


Leu 


Leu 


Val 


Ala 
175 


Leu 


Pro 


Arg 


Trp 


Gin 
180 


Asn 


His 


Leu 


His 


Lys 
185 


His 


Gly 


Arg 


Phe 


Ala 
190 


Asp 


Asp 


Val 


Leu 


Leu 
195 


Ser 


Val 


Leu 
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